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Introduction to ARL 


The Army Research Laboratory of the U.S. Army Research, Development and Engineering Command (RDECOM) is the Army’s 
corporate laboratory. ARL’s research continuum focuses on basic and applied research (6.1 and 6.2) and survivability/lethality 
and human factors analysis (6.6). ARL also applies the extensive research and analysis tools developed in its direct mission 
program to support ongoing development and acquisition programs in the Army Research, Development and Engineering 
Centers (RDECs), Program Executive Offices (PEOs)/Program Manager (PM) Offices, and Industry. ARL has consistently provided 
the enabling technologies in many of the Army’s most important weapons systems. 

The Soldiers of today and tomorrow depend on us to deliver the scientific discoveries, technological advances, and the analyses 
that provide Warfighters with the capabilities to execute full-spectrum operations. ARL has Collaborative Technology Alliances in 
Micro Autonomous Systems and Technology, Robotics, Cognition and Neuroergonomics, and Network Science, an International 
Technology Alliance, and new Collaborative Research Alliances in Multiscale Multidisciplinary Modeling of Electronic Materials 
and Materials in Extreme Dynamic Environments. ARL’s diverse assortment of unique facilities and dedicated workforce of 
government and private sector partners make up the largest source of world class integrated research and analysis in the Army. 

ARL Mission 

The mission of ARL is to “Provide the underpinning science, technology, and analysis that enable full-spectrum operations.” 

Our Vision 

America’s Laboratory for the Army: Many Minds, Many Capabilities, Single Focus on the Soldier 


ARL’s Organization 

•Army Research Office (ARO) - Initiates the scientific and far reaching technological discoveries in extramural organizations: 
educational institutions, nonprofit organizations, and private industry. 

•Computational and Information Sciences Directorate (CISD) - Scientific research and technology focused on information 
processing, network and communication sciences, information assurance, battlespace environments, and advanced computing 
that create, exploit, and harvest innovative technologies to enable knowledge superiority for the Warfighter. 

• Human Research and Engineering Directorate (HRED) - Scientific research and technology directed toward optimizing Soldier 
performance and Soldier-machine interactions to maximize battlefield effectiveness and to ensure that Soldier performance 
requirements are adequately considered in technology development and system design. 

•Sensors and Electron Devices Directorate (SEDD) - Scientific research and technology in electro-optic smart sensors, 
multifunction radio frequency (RF), autonomous sensing, power and energy, and signature management for reconnaissance, 
intelligence, surveillance, target acquisition (RISTA), fire control, guidance, fuzing, survivability, mobility, and lethality. 

•Survivability/Lethality Analysis Directorate (SLAD) - Integrated survivability and lethality analysis of Army systems and 
technologies across the full spectrum of battlefield threats and environments as well as analysis tools, techniques, and 
methodologies. 

• Vehicle Technology Directorate (VTD) - Scientific research and technology addressing propulsion, transmission, aeromechanics, 
structural engineering, and robotics technologies for both air and ground vehicles. 


•Weapons and Materials Research Directorate (WMRD) - Scientific research and technology in the areas of weapons, protection, 
and materials to enhance the lethality and survivability of the Nation’s ground forces. 


ARL Workforce in 2013 

• 1,971 Civilians - 41 Military 

• 1,181 Research Performing Workforce 

• 578 (49%) hold PhDs 

• 13 STs/26 ARL Fellows 

ARL’s Primary Sites 

• Aberdeen Proving Ground, MD 

• Adelphi Laboratory Center, MD 

• White Sands Missile Range, NM 

• Raleigh-Durham, NC 

• Orlando, FL 

Visit ARL’s web site at www.arl.army.mil 






Unique ARL facilities and modeling capabilities 
provide our scientists and engineers access to 
world-class research centers. 
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FOREWORD 

Welcome to the second edition of the Research @ARL monograph series. In this 
edition, we share ARL’s leading edge research in the Network Sciences. Today’s 
society is more networked than ever before, and today’s Army reflects that reality. 

However, the tools needed to understand the science of our interconnected global 
society are not yet fully developed. Many of the networks that touch our everyday 
lives, both in the Army and outside, are complex networks. They are often governed 
by local interactions and robust self-organization principles, leading to complex 
dynamics. And networks do not live in isolation; they are interconnected in surprising 
ways. A modern Army cannot function—any more than a modern city can—without the 
latticework of transportation systems, utilities, water and food distribution systems, 
communication webs, health care delivery organizations, and a myriad of other 
interconnected, complex networks. These layers of interpenetrating networks paint 
a picture of today’s Army that overarches the traditional physical sciences, social 
sciences, life sciences, and information sciences. No one discipline or collection of 
disciplines can fully embrace the spectrum of complex problems that the engineered 
networking of humanity presents. 

ARL’s research program in the Network Sciences capitalizes on ARL’s expertise in basic and applied research, ranging across 
a multitude of science areas, while simultaneously drawing upon the talents of others, nationally and internationally, through 
collaborative alliances with industry, laboratories, and academia. We seek to discover the underlying mathematical principles 
and universal laws that govern the behavior of co-evolving multi-genre networks. We also seek to understand emergent behavior 
in such networks, leading to mechanisms for thwarting or accelerating the emergence of interesting behaviors. Our research 
teams are examining the complex neurological networks within the human brain to determine how individuals make decisions 
and perform other cognitive tasks under varying levels of stress and uncertainty. The insight gained from this research is 
being applied to other biological networks to alleviate the injuries and pathologies encountered by the Warfighter. Exploration 
of cooperative behavior of social networks, such as swarms, is a significant research opportunity to control the mobility of 
collections of autonomous platforms. In addition to networks in the life and social sciences, ARL teams are examining the 
connectedness within and between communications and information networks to determine how their complex structures can 
best facilitate the gathering of intelligence and the formation of collective decisions. 

As we investigate the fundamental science addressing these critical applications, we search for laws and principles of Network 
Science that parallel and compliment those in physics. Tentative principles are presently guiding the development of the 
mathematical infrastructure necessary for understanding those aspects of complex networks common to multiple phenomena 
appearing in a variety of scientific disciplines. 

ARL’s long term commitment to encompass interdisciplinary sciences in such diverse areas as vulnerability assessment, 
cognitive sciences, human sciences, electronic devices, weapons and materials sciences, information sciences, survivability, 
and lethality, uniquely positions our teams to investigate and discover previously unrealized commonalities to provide innovative 
solutions. Our discoveries enhance our country’s technological capabilities, our national security, and, ultimately, enable the 
invincible Soldier of the future. 



Dr. John Pellegrino 

Director (A), U.S. Army Research Laboratory 
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Introduction to ARL Research in Network Sciences 
BJ.West, A. Swami, J.N. Mait, and PJ. Franaszczuk 


1 Army Research Laboratory: Network Science Program 

The objective of the Network Science Program of the U.S. Army Research Laboratory (ARL) is to perform foundational research 
on network science (NS) leading to a fundamental understanding of the interplay within and among the physical, social/ 
cognitive, information, and communication networks. This research is expected to lead to insights on how processes and 
parameters in one network affect and are affected by those in other networks. The underlying long-term goal is to optimize 
human performance and to greatly enhance speed and precision for complex networked military operations. 

We define Network Science as the study of complex systems whose behavior and responses are determined by exchanges 
and interactions between subsystems across a possibly dynamic and usually poorly defined set of pathways. The fundamental 
components of a network are its structure (nodes and links or pathways) and its dynamics, which together specify the network’s 
properties (functions and behaviors). 

The Army is embarked upon an information age transformation loosely referred to as Network Centric Warfare (NCW) or 
Network Centric Operations (NCO). NCW/NCO seeks to dramatically increase mission effectiveness via robust networking for 
information sharing leading to shared situational awareness, improved collaboration and self-synchronization, and enhanced 
sustainability and speed of command [1, 2]. Central to this effort is an understanding of the interaction of networks in the 
physical, informational, cognitive, and social domains; thus, ARL’s NS program directly supports NCW/NCO. 

Society, as pointed out in the 2005 National Research Council (NRC) report Network Science [3], is more interconnected 
than it has been at any time in world history. A modern city could not function without garbage collection, sewers, electricity, 
transportation, water, health care, and food and fuel distribution, and would have a much different form without networks of 
education, banking, telephone service, and the Internet. Some of these activities form local physical or social networks within 
the city and their forms have been evolving for millennia. Part of that evolution was the development of their interoperability such 
that these networks are all interconnected and in one way or another they connect to ever-expanding global networks. These 
interconnections can lead to robustly coevolving networks, but under some conditions, can also lead to cascading failures. This 
corresponds to the first finding of the NRC report: “Networks enable the necessities and conveniences of modern life.” These 
are the engineered networks of humanity, but there are comparable structures in the biosphere and ecosphere involving plant 
and animal networks of tremendous variety. 

Moreover, our internal world is also an interlacing and interacting collection of networks. The neuronal network carrying the 
brain’s signals to the body’s physiological networks is even more complex than the modern city. The bio/eco-networks are 
certainly as difficult to understand as the physical/social-networks. This led to another finding in the NRC report: “Social and 
biological networks bear important similarities to engineered networks.” Herein we present a number of studies carried out by 
ARL scientists in collaboration with the broader scientific community to further our understanding of NS. These studies suggest 
how these similarities might be exploited for the development of NS that is of value to society in general and to the United States 
Army in particular. 

The insights and mathematical characterizations resulting from ARL’s NS research will enable us to achieve this underlying 
goal, by providing the ability to predict and control the individual and composite behavior of the complex interacting physical, 
communications, information, and social/cognitive networks. Network science is one of ARL’s strategic technology investment 
areas, with a strong program based on the modern research pillars of theory, simulation, emulation, and experimentation. It 
encompasses a variety of transdisciplinary basic and applied programs across ARL’s core competency areas that include close 
collaborations with academic, military, and industrial research partners throughout the US and internationally. These include 
ARL’s Network Science Collaborative Technology Alliance (CTA), the US-UK Network and Information Sciences International 
Technology Alliance (ITA), the Micro Autonomous Systems & Technology CTA, the Cognition & Neuroergonomics CTA, and several 
Multidisciplinary University Research Initiatives (MURIs). The Institute for Collaborative Biotechnology is a University Affiliated 
Research Center (UARC) that approaches networks from the perspective of biological systems, and ARL’s Mobile Network 
Modeling Institute (MNMI) is an applied research effort that seeks to further the science by enabling full fidelity at-scale 
modeling and simulation of large scale heterogeneous networks, incorporating live assets in the emulation environment. More 
information on these programs may be found by navigating ARL’s Collaborative Technology and Research Alliances website at 
http://www.arl.army.mil/www/default.cfm?page=93. 
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The taxonomy of the ARL research efforts in NS can take a number of forms, two of which are given here. To set the stage for 
the scientific problems, we summarize four grand challenges and identify the Army programs conducting research to address 
these challenges. A separate and distinct way of organizing the research is via the network category addressed. The individual 
research papers are identified in this latter grouping according to discipline. 

2 Grand Challenges 

The ARL research effort in NS seeks to make fundamental advances of and contributions to the science so as to enable the a 
priori prediction of the behaviors of complex interacting networks in diverse and dynamic environments and an understanding 
of the design trade-offs and the impacts of various technologies under a wide variety of dynamic and adverse conditions. 
Moreover, it is intended to quantify the impact of network technologies both technically and operationally to make informed 
acquisition decisions. 

2.1 Underpinning Mathematical Framework: Dynamic Multiple-Scale, Cross-Genre Coupling 

Multiple-scale and cross-scale couplings within and across networks are recurrent themes in the study of complex physical, 
communication, informational, social, and biological networks. Existing formalisms address one, two, or a continuum of scales, 
but the multi-node problem remains unsolved. Existing mathematical approaches offer insights into the quantitative aspects 
of complex phenomena with multiple scales (no characteristic scale) in stationary and near equilibrium networks, but what 
is needed now is a way to describe the dynamics of such complex phenomena/networks when the underlying processes 
are neither stationary nor near equilibrium [4]. Network characteristics cannot be deduced from the properties of individual 
components; they emerge during the formation and growth of the network. Consequently a mathematical framework is needed 
to characterize the interactions between the dynamic network components, the temporal evolution of the network, and its 
response to external stimuli, including attacks. The framework must take into account heterogeneity, non-stationarity, even 
non-ergodicity, as well as conflicting constraints and objectives. 

One way to characterize the interaction of networks is by understanding how information is propagated within and across 
complex adaptive networks — how information, in a general sense, flows across networks so as to maximize utility (e.g., as 
perceived by a decision maker) under constraints, such as timeliness and resource usage. Tools from statistical physics such 
as continuum percolation theory, stochastic geometry, and modern statistics are used to deal with non-Markovian behaviors, 
heavy-tailed phenomena, and the mutual interactions between networks. To cope with multiple potentially conflicting constraints 
and objectives, a generalized control theory of locally reacting multiple agents needs to be developed, perhaps through the 
mimicking of self-repair and replication identified in biological networks. 

2.2 Human Metrics in Complex Networks 

A critical element in realizing the vision of Network-Centric Operations (NCO) is that the distributed decision making process 
must take into account the human dimension in its elements including decision-makers, data and information sources, 
processed information, communication network elements, etc. The measure or metric of the human dimension must be 
derived in a distributed fashion; in time-critical and stressful situations; in environments where node capture and subversion 
are likely; where the underlying communications network is resource-constrained, mobile, and dynamic; and where decision- 
makers’ reliance on and compliance with an information network are subject to numerous internal and external influences. 
Key barriers include the limited understanding of network-of-networks, composability of heterogeneous networks, and a limited 
understanding of how the context (e.g., trust and risk) is analytically derived from variously distributed sources of evidence. 

A key aspect of a common quantitative framework is an approach for representing uncertainty (probability theory, possibility 
theory, evidence theory, subjective logic, argumentation theory, or other). Development of metrics is a key issue: metrics that 
take into account the value of information and human intentions are important in information and C2 networks. Relations 
between and among these metrics need to be clarified along with the development of a mathematics of network classes. A 
specific example is the trust metric. The scientific challenge is to understand the different definitions and dimensions of trust, 
for example, in socio-cognitive and communications networks, and from that understanding develop a composite trust metric. 
Rather than merely treating the interaction via a simple interface or utility function offered by one network to another, network 
interactions must be taken into account, as well as the context- and time-varying nature of the trust metric components. 
Encapsulation of these sub-components as a scalar metric is challenging due to the conflicting nature of the components and 
the non-convexity of useful metrics. To quantify human performance in networks, measurable features of data from different 
networks need to be identified to enable the modeling of network complexity and the interactions between different networks. 
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2.3 Abstracting Common Concepts Across Fields 

A set of common concepts for network science that is defined consistently across the various scientific disciplines does not 
exist. There are discipline-specific nomenclatures that have been developed for specialized needs, but the equivalence of 
these terminologies for a discipline-independent characterization of networks has not been established. The mathematical 
formulation of these empirically-based concepts will provide a nascent language for network science. 

Mathematical rigor across the component disciplines is promoted by focusing on ubiquitous aspects of the underlying complex 
networks, such as the appearance of non-stationary, non-ergodic, and non-renewal statistical processes. These properties 
are manifest through the empirical inverse power-law (IPL) probability density functions (pdf) that appear in physical, social, 
biological, economic, and physiological networks. Tools from non-equilibrium statistical physics and non-equilibrium information 
theory are used as well. To understand and facilitate network agility and adaptability, new techniques, methodologies, and 
analyses from a multi-disciplinary approach have been and will continue to be combined with the application of powerful 
modeling and simulation to explore the dynamics of networks. High-fidelity simulation, emulation, and experiments for large- 
scale dynamic networks — based on complexity-performance trades and incorporation of live assets in an emulation environment 
— will provide us the ability to understand the complicated interactions between large-scale heterogeneous networks. 

3 Taxonomy by Discipline 

NS is unlike traditional scientific disciplines in that it identifies a particular structure, that of a network, as the basis for the 
science. Since structural webbing exists within and among all scientific disciplines, the over-arching goal of NS is to develop 
a body of principles that will enable modeling, design, analysis, prediction, and control of the behavior of the underlying 
phenomena in the physical (sensors, wireless web), information (communications, knowledge management), social (people, 
organizations, cultures), and cognitive (perceptions, beliefs, decision-making) sciences, to name only a few. The emphasis is 
not restricted to the network structure within a given discipline, but is focused on networked phenomena that encompass many 
disciplines as well as how such complex networks interact with one another. As a nascent science, the underlying principles 
are still missing, so the grand challenge for this effort is to create and organize the fundamental knowledge upon which to 
base these principles. Moreover, NS is intended to bridge the knowledge gap between disciplines and breakdown the artificial 
barriers that have been created between disciplines. 

We have partitioned the scientific papers into four sections, each corresponding to a different category of disciplines: the 
physical sciences, the life sciences, the social sciences and the information sciences. This partitioning is intended to direct the 
reader to that area of investigation with which s/he is most comfortable. On the other hand, this opening narrative is intended 
to suggest that NS is more than just a thematic thread that weaves its way through each of the sections; it is intended as an 
over-arching perspective that encompasses the traditional disciplines. However, in the rush to discuss the tangible aspects of 
science, the fieldable products of technology, and the immediate needs of the Army, the network view so crucial to the scientific 
enterprise is often overlooked. It is emphasized here because, like the scientific barrier of complexity, it often represents the 
problem we would like to solve, but do not know how. 

3.1 Physical Sciences 

Physics has been the paradigm of science since the time of Newton. Consequently, when investigators wanted to emphasize 
the use of quantitative methods in their studies, they appended the suffix physics. In the 19th century this strategy resulted in 
sociology giving rise to sociophysics, psychology spawning psychophysics and biology producing biophysics. Of course, there 
were also a number of physicists speculating on the use of their techniques to understand non-physical phenomena. As a 
result, new disciplines evolved at the boundary between previously established disciplines. This way of extending the frontiers of 
disciplinary science proceeded into the 20th century, producing biochemistry, bioengineering, social engineering, econophysics, 
and so on. But it was the successes of the mathematical methods of physics that attracted the most attention. 

The existence of an empirical law or principle from which the dynamics of a physical system can be determined appears almost 
mystical when trying to understand cognition or decision making. The 19th century physicists and applied mathematicians were 
able to take the conservation of energy and from that statement determine the laws of motion for any mechanical system. Such 
variational calculations in the hands of more practical people subsequently produced control theory and the dynamics of non¬ 
mechanical systems consistent with a set of imposed constraints. 

Sardellitti and Barbarossa of Sapienza-University of Rome collaborated with Swami of ARL on, “Optimal Topology Control 
and Power Allocation for Minimum Energy Consumption in Consensus Networks” (page 21), to design a network topology 
that optimizes the spread of information, exemplified by the computation of globally relevant statistics by exploiting only local 
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communication among the sensors in a wireless network. As in all design problems, there is a function to be carried out by the 
network while simultaneously satisfying a given constraint such as maximizing efficiency or minimizing energy consumption. Is 
the optimal topology sparse or dense? The research of Sardellitti et al. was concerned with minimizing energy use; the paper 
of Hollingshad et al., discussed later, was concerned with network efficiency, “A New Measure of Network Efficiency" (page 
13). Sardellitti et al. developed a method for optimizing the network topology and the power allocation across every link in 
order to minimize the energy necessary to achieve consensus across the sensor network. Since the topology is the result of 
optimization, this is a non-deterministic polynomial time (NP) combinatorial problem which the authors circumvent by means 
of a clever relaxation procedure. 

Another approach to handling network complexity is by recognizing that the elements comprising a fighting force are interlinked 
to such a high degree that changing a single component influences and modifies the response of every other component in 
the network. Furthermore, each element in the network is itself a network and must be treated as such. This observation does 
not in itself contribute anything new to the discussion. It has been known since the first recorded wars that small changes in 
tactics and strategy can lead to overwhelming changes in outcome. This recognition has sometimes led to inaction, because the 
outcome of any action is unpredictable. What is new about the networking concept is the attempt to formalize the interactions 
so as to make the response to change predictable. New methods, such as nonlinear dynamics theory and chaos, have enabled 
us to determine the boundaries of predictability in apparently random dynamic processes, so we can realistically assess what 
we can and cannot know about a particular network. But it all goes back to the individual Soldier, situational awareness, and 
leadership involving the synthesis of information in real time. 

The research group consisting of Hollingshad, Turalska, and Grigolini at the University of North Texas; Allegrini from the University 
of Pisa; and West from ARL collaborated on constructing, “A New Measure of Network Efficiency" (page 13). They address the 
dynamical origin of the scale-free probability distribution function (pdf) and, consequently, the related issue of network topology. 
Here again, the local interactions are found to determine the global properties of the network, using a nonlinear mathematical 
model with nearest neighbor interactions on a two-dimensional lattice. The network dynamics undergo a phase transition at 
a critical value of the control parameter, thereby inducing long-range correlations among the members of the network. The 
researchers determine that the distribution of links favoring long-range connections increases the efficiency of information 
transfer, which becomes optimally efficient when the IPL index for the degree pdf is one. They attribute this to the role of network 
leadership being transient, with the rank differences between the most-linked and least-linked nodes being temporary and over 
time the highest rank is shared by all members of the network. The full implications of this insight remain to be explored. 

A fundamental difficulty with the concept of networking networks, such as nature has done within the human body or as applied 
to material structures, is related to the multiple scale and multi-physics requirements, as seen, for example, in electronic 
networks. If all physics were known and we had the necessary mathematical tools, we could model the ultimate performance 
of any physical network given the atomic composition of the parts. Quantum physics would enable atomic and nano-scale 
properties to be captured and parameterized yielding the electronic, thermal, and mechanical attributes of materials. From this 
ability to effectively couple across scales, we could eventually determine the capabilities and performance of multifunctional 
networks interrelating sensor, information, and communication capabilities. We could do this without fabricating network 
components, and we could achieve optimization at a level of sophistication that is beyond our wildest dreams using current 
technology and network development. On the other hand, it seems unreasonable to model every atom and electron in a physical 
network, yet the atomic level properties affect the composite performance and, therefore, cannot be neglected. At criticality, 
the smallest fluctuations are often amplified to the macroscopic level and, therefore, contain information on the nature of the 
large-scale network. 

Ren and Zhao of the University of California, Davis have collaborated with Swami of ARL on, “Connectivity of Heterogeneous 
Wireless Networks" (page 53). This research results in a better understanding of the spatial connectivity of one heterogeneous 
network (secondary) embedded within another (primary); the heterogeneity is modeled using Poisson statistics for the ad hoc 
network. The analysis is done using percolation theory, which was initiated in statistical physics and has since developed into 
its own branch of mathematics. Percolation theory is a natural method for answering the question of whether the secondary 
network can be connected, i.e., when can a message be transmitted across the entire network without disruption? The 
researchers establish that when the density of the primary transmitters is beyond a certain level, there are simply not enough 
spectrum opportunities for any secondary network to be connected. The design parameters are the interference tolerances 
of the primary and secondary Poisson distributed networks and are used to design the optimal transmission power of the 
secondary network based on that of the primary. Consequently, the connectivity of large-scale ad hoc heterogeneous wireless 
networks are expressed in terms of the occurrence of the percolation phenomenon. It turns out that matching the interference 
ranges of the primary and secondary networks maximizes the tolerance of the secondary network to the primary traffic load. 
Since the tolerable interference range is an indicator of the receiver’s sophistication, this can be interpreted as an instance of 
the complexity matching principle discussed in, “Transmission of Information Between Complex Systems: l/f Resonance" (page 
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233), and “Temporal Complexity of the Order Parameter at the Phase Transition" (page 213 ). 

A systematic procedure based on multiple models at different scales with representations at adjacent scales linked to each 
other by defined parameterized interfaces that can be calibrated to measurement is one reasonable approach to multi-scale 
modeling. This is done today through a combination of deterministic and stochastic modeling, where the smaller scales give rise 
to statistical uncertainty at the larger scales. Thus multi-scale modeling involves multiple organizations and disciplines leading 
to multiple, seemingly incompatible approaches. One school of thought believes it essential that an object-oriented approach 
be followed, meaning in this context that there are well defined interfaces to models capturing a particular scale and that the 
various scales are captured, most likely, by stand-alone programs. This multi-scale modeling is a grand scientific/engineering 
challenge for Army research. 

Metz at the University of Ulm; Mitasova at North Carolina State University; and Harmon at ARL collaborated on, “Efficient 
Extraction of Drainage Networks from Massive, Radar-Based Elevation Models with Least Cost Path Search" (page 39). They 
use a least-cost drainage path (LCP) search methodology to significantly improve the accuracy of stream mapping, especially in 
remote, hard to reach regions and those that can be mapped with massive data sets. The LCP search methods were designed 
to find the shortest or fastest route from a starting point to a given destination. The success in extracting reliable drainage 
networks from massive radar-based data sets indicates that nature must optimize a cost function over geophysical time scales 
in direct analogy to the variation in total energy determining the dynamics of mechanical systems. Here the cost is interpreted 
in terms of local elevation and the least-cost path is minimal elevation contour. 

3.2 Life Sciences 

Complex networks in the life sciences generate patterns that have been summarized in a number of empirical relations. In 
physics we have Ohm’s law, the perfect gas law, and other empirical relations among the variables describing the physical 
process. These “laws" are, of course, phenomenological rules developed to summarize what has been observed experimentally 
and that were eventually derived using fundamental theory. In more complex living networks, equally simple rules have been 
uncovered that typically relate a particular function to the size of an organism, as first observed in biology some two hundred 
years ago. The first such expression related the mass of an animal’s brain to its total body mass, using mass as a measure of 
size and giving rise to the nomenclature of allometry relation (AR). Allometry, literally meaning by a different measure, has been 
defined as the study of body size and its consequences both within a given organism and between species in a given taxon. 

The history of ARs is presented by West at Renesselaer Polytechnique Institute and West at ARL, who critiqued the various 
attempts to construct the theory underlying the scaling observed in ARs in, “On Allometry Relations" (page 115). ARs are 
critically important because they set the boundary conditions that network models must satisfy in order to be compatible with 
existing data. For example, the fractal network nutrient transport model used to successfully derive the metabolic AR is one of 
the network applications reviewed. 

Another emerging application of network theory is in analyzing interactions and relationships in groups of animals and/or 
humans. The network science methodology allows for better understanding of the complexity of interactions and communications 
between individuals in groups. The review by Pollard, formerly at the University of California, Los Angeles and now with ARL, 
and Blumstein at ARL, “Evolving Communicative Complexity: Insights from Rodents and Beyond" (page 87), discusses the 
relationship between social interactions and communication patterns, and how focused studies of communicative complexity 
can better untangle the underlying evolutionary relationships. Thefunctional relationship between attributes of social complexity 
and attributes of communicative complexity allow for a better understanding of how socialization and communication evolve. 

The importance of interactions in neural networks for understanding of the operation of the brain has been recognized for a 
longtime. However, only recently have advances in theoretical, experimental, and computational approaches to the study of 
neural networks begun to reveal fundamental principles of brain structure and function. There is emerging consensus among 
neuroscientists that traditional approaches concentrating on identifying brain structures responsible for specific functions and 
behavior need to be expanded to include investigations of interactions between these structures. New brain imaging techniques 
allow for more precise identification of neural tracts connecting different regions of the brain, providing information necessary 
for creation of the structural connectome network of an individual brain. This allows for analyzing and interpreting characteristic 
features in different individuals as well as identifying abnormalities caused by neurotrauma. The article by Kraft, Mckee, and 
Dagro at ARL in collaboration with Grafton at the University of California, Santa Barbara, “Combining the Finite Element Method 
with Structural Connectome-Based Analysis for Modeling Neurotrauma: Connectome Neurotrauma Mechanics" (page 99), 
presents a novel approach to analysis and interpretation of structural connectome obtained from MRI images by constructing a 
finite element model (FEM) informed by experimental data at a cellular level. This approach integrates brain injury biomechanics 
and graph theoretical analysis of neuronal connections to form a computational model capturing spatiotemporal characteristics 


9 


of trauma. Time evolving simulations of the trauma investigated in this study show how brain network measures of global 
and local efficiency are degrading in time after trauma. This approach provides a basis for quantitative analysis tools for 
interpretation of neuroimaging data. Computational models based on theoretical analysis of neural network connections could 
help inform subsequent analysis of functional networks. 

Neural network models based on neurophysiological properties of neurons provide insight into functional relationships between 
different brain structures. In recent years, due to increased computational capabilities, it has become possible to model larger 
networks composed of more realistic neurons. This allows for more realistic simulations of normal and/or pathological brain 
function, which can be directly compared with experimental or even clinical measurements. The article by Anderson, Azhar, 
Kudela, and Bergey at Johns Hopkins University in collaboration with Franaszczuk at ARL, “Epileptic Seizures from Abnormal 
Networks: Why Some Seizures Defy Predictability" (page 73), illustrates how this modeling approach can provide some insights 
into observed but difficult to interpret phenomena within the human brain. Epileptic seizures are the pathological activity of 
the brain disrupting the normal activity of an afflicted subject. While a seizure usually lasts only for a relatively short period 
of time (1-2 minutes), its unpredictability causes significant disruption in the life of the subject and may even cause serious 
injury due to associated loss of consciousness and body control. There is limited opportunity to record data associated with 
epileptic seizures. Even during hospitalization, patients may have as few as one or two seizures during an entire week. The 
computationally realistic modeling allows for investigation of multiple simulated seizures with full control of critical parameters. 
In Anderson et al., the authors created a computational model of the cortex consisting of more than 60,000 spiking neurons 
of several types based on histological data. Analysis of simulations of neuronal activity in this model suggest that, in the region 
of the cortex, with abnormal connectivity analogous to seizure focus in the human brain, it is possible to initiate seizure activity 
with random fluctuations of input from the surrounding cortical regions. This mechanism helps explain the difficulty in predicting 
partial seizures in some patients and suggests that intervention not only in the seizure focus area, but also in surrounding 
areas, may be necessary to prevent seizures from occurring. 

The continuing research at ARL is directed toward combining the structural modeling of brain networks, “Combining the Finite 
Element Method with Structural Connectome-Based Analysis for Modeling Neurotrauma: Connectome Neurotrauma Mechanics" 
(page 99), with neurophysiological models similar to those presented in “Epileptic Seizures from Abnormal Networks: Why Some 
Seizures Defy Predictability" (page 73) to create a more complete model of brain neural networks. 

The diverse applications of network-based or inspired studies in life sciences presented here show that the network sciences 
concepts and methodology provide a useful conceptual platform for description and analysis of various phenomena and 
processes. However, we are in the early stages of applying NS concepts to the various life sciences. Development of more 
specific mathematical and computational tools is required, but the future applications are limitless. 

3.3 Social Sciences 

In tactical networks, distributed decision making should take into account trust in the elements: the sources of information, the 
processors of information, the elements of the communications network (both radio and other) across which the information 
is transmitted, etc. This trust must often be derived under time-critical conditions, and in a distributed way. It must account for 
the uncertain provenance of the data, including the fact that information sources and relays in a tactical network are subject 
to subversion. Trust is a critical element of networked interactions, including interactions between humans and agents, that 
may be direct or computer-mediated. Trust and reputation are becoming increasingly important due to the explosive growth 
in electronically mediated social interactions, often with sometimes correspondingly weakened social control. Trust is a multi¬ 
faceted concept and depends upon the context, the trustor, the environment, the infrastructure, and the mediating agents or 
relays. Trust is a critical element of modeling team, group, squad, and company behaviors. It must be taken into account in 
promoting group cohesiveness and effective problem-solving, and it must be adequately modeled in quantifying the tension and 
trade-off between individual utility and group welfare. Technology can enhance as well as weaken trust relationships. Currently 
used models of trust do not adequately capture critical elements of human trust (such as lack of transitivity, symmetry, and 
reciprocity, which calls for novel mathematical tools). Managing trust in such an environment is challenging. Cho and Swami 
from ARL collaborated with Chen from Virginia Tech on the paper, “A Survey of Trust Management in Mobile Ad Hoc Networks" 
(page 189), which addresses some of these challenges. It discusses the concepts and properties of trust pertinent to the 
constituent elements of a tactical network and proposes composite trust metrics that take into account the interaction between 
the constituent networks, the resource constraints, and the mission goals. This comprehensive survey discusses the differences 
between trust and trustworthiness, trust and risk, and trust management schemes. 

How can one maximize rewards in an unknown environment? Learning is clearly a crucial element here. There are two important 
performance criteria to assess the efficacy of learning: consistency and regret bounds. A learning algorithm is consistent if the 
learned estimates asymptotically converge (in some sense) to the true values. Regret is a measure of the speed of convergence, 
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the time-averaged loss in performance compared with that obtained by a genie. Anandkumar from the University of California, 
Irvine, and Michael and Tang from Cornell University collaborated with Swami from ARL on aspects of this problem in the paper, 
“Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret" (page 173). The scenario they 
considered is related to the problem studied in “Connectivity of Heterogeneous Wireless Networks" (page 53). They consider the 
problem of so-called secondary users learning the unknown rewards (perhaps rates) obtained by accessing a set of channels. 
They consider a distributed framework where there is no information exchange or prior agreement among the secondary users. 
This introduces additional challenges: First, one can expect loss of performance due to collisions among users. Second, there 
is competition since all the secondary users are vying to access channels with the highest rewards. Using the theory of multi¬ 
armed bandits, they analyze the trade-off between exploration and exploitation and between cooperation and competition. They 
propose an order optimal distributed learning and access policy that minimizes the regret. An important notion here is that 
one must learn from mistakes (collisions and sensing that may be imperfect) and that one can never stop learning (one must 
continue to explore). 

Cooperative interactions can lead to phase transitions and the emergence of spatial and network complexity in many systems. 
The Ising model has been used to model phase transitions in a variety of phenomena, including ferromagnetism, biological 
and neurophysiological processes, the flocking of birds, econophysics, etc. But the temporal complexity of such networks had 
not been explored until recently. Turalska and Grigolini from the University of North Texas collaborated with West of ARL on 
the paper, “Temporal Complexity of the Order Parameter at the Phase Transition" (page 213), which explores the temporal 
complexity issue. They study a decision making model in which the time intervals between significant events has a power-law 
distribution, which they show is equivalent to a two-dimensional Ising model. The underlying process is a renewal process, 
but it is non-stationary and exhibits long-term correlations. Thus the fundamental property of ergodicity no longer holds, and 
new mathematical tools are required to analyze these non-Poisson renewal processes. The researchers establish that phase 
transition leads to temporal complexity as manifested by non-stationary and non-ergodic fluctuations. They argue that this 
is a basis for complexity matching, a notion elaborated on in, “Transmission of Information Between Complex Systems: 1/f 
Resonance" (page 233). The pair of papers, “Temporal Complexity of the Order Parameter at the Phase Transition" (page 213) 
and “Transmission of Information Between Complex Systems: l/f Resonance" (page 233) indicate that information can be 
transferred between networks via extremely weak coupling, provided that the complexity of the two networks is matched (in 
the sense that a form of resonance exists). The concepts of complexity management and complexity matching are likely to find 
wide applicability in NS. 

3.4 Information Sciences 

In the social and life sciences, network discovery has led to increased understanding of how the brain works and how 
information propagates in communities of people. In the information sciences, one wishes to create a physical network that 
enables communication between entities, either person-to-person, person-to-machine, or machine-to-machine. Design of the 
physical network must account for the manner and the environment in which the network will be used. 

In most non-military situations, the backbone of this network is fixed, for example, through cell towers. Senders and receivers 
are tethered to the backbone, that is, fixed spatially and hard-wired, or untethered, physically mobile, and wirelessly connected. 
However, in the environments in which the military operates, designers cannot expect a wireless infrastructure to be available for 
use. Thus, military units must be capable of establishing and maintaining their own network, which is a challenge during mobile 
operations. It is even more of a challenge when nodes on the network are small, autonomously mobile platforms operating on 
city streets and inside buildings, where floor plans and building materials affect the viability of links between nodes. 

The use of multiple autonomous platforms is inspired by bionetworks in nature such as insect swarms. Although each single 
entity in a swarm has limited intelligence, the swarm appears to behave with intent when the entities function in concert. 
Consider the behavior of social insects, such as bees and ants. Fish and birds are other examples of animals whose collective 
behavior belies their limited intelligence. Although this analogy is intellectually appealing, differences exist between bionetworks 
and networks of autonomous mobile platforms that make it difficult to implement. Bionetworks, for example, swarm for survival, 
either to seek food or provide protection. In our grandest vision, swarms of autonomous platforms will exhibit complex behavior, 
such as intelligence or sentience through “situational awareness." Further, communication within animal swarms occurs 
through proximity, e.g., ants touching antennae or following a chemical trail, line-of-sight vision, or sound. Put simply, biological 
swarms communicate simple messages primitively to perform primitive behaviors, whereas our vision of autonomous platforms 
requires complex messaging to achieve complex behaviors. 

Given the present state of technology, a better analogy may be a shepherd herding sheep using a sheep dog. That is, a single 
intelligent entity controls a layered hierarchical network wherein each layer contains more entities of lesser intelligence than 
the layer above it. Whether a swarm or a pack is the better analogy, in any case, researchers have only scratched the surface 
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of issues concerned with designing a mobile network created by a collective of autonomous platforms. Controlling the mobility 
of a pack of autonomous platforms and communicating within it remain unsolved problems. 

How platforms move to achieve their mission, for example, generating the floor plan of a building previously unexplored by US 
units, yet maintain connectivity is the focus of the paper, “Robust Control for Mobility and Wireless Communication in Cyber- 
Physical Systems with Application to Robot Teams," by ARL researcher Fink and colleagues Riberio and Kumar at the University 
of Pennsylvania. The key element of the paper, “Robust Control and Mobility and Wireless Communication in Cyber-Physical 
Systems With Application to Robot Teams" (page 247), is the authors’ statement that “successful deployment of an autonomous 
team of robots requires joint cyber-physical controllers that determine (physical) trajectories for the robots while ensuring 
(cyber) availability of communication resources." The authors’ approach to solving this problem is to consider communication 
parameters (bandwidth, attenuation, noise) as virtual obstacles that restrict platform movement. However, in an iterative 
control cycle, after platforms move, the network readjusts communications parameters to provide reliable communication (e.g., 
establishes new links, removes old ones, increases transmitted power, or reduces transmission bandwidth). 

The basis for control is minimizing a potential function subject to the constraint that data transmission in every established link 
is above some threshold data rate. The strength of this approach is its global perspective of all links. Previous methods consider 
mobility from the perspective of local links only. Simulations and experiments with five mobile ground platforms validate the 
viability of this approach. 

Analogies to nature inform the design of these experiments. In addition, they provide a benchmark against which one can 
measure performance. However, the experiments provide a perspective for reinterpreting and understanding nature. Does 
the collection of individual movement and communication minimize a global potential? If so, what potential? The total energy 
expended by the network to achieve goal? How does the mode of communication impact an individual’s movement if the 
communication is non-line-of-sight versus line-of-sight? The answers to these questions fall back into the realm of network 
science theory. 

Designers of the physical networks that enable the transfer of electronic information need to consider means for simple, rapid, 
and robust data transfer. In applications where the network is stable and network operators have considerable opportunity to 
monitor and control the network, for example, commercial networks, message traffic can be handled sequentially. However, in 
complex, less stable, environments, parallel transmission of messages provides more robust operation. In this case, messages 
between two channels are transmitted contemporaneously to a relay. The relay combines the messages and retransmits the 
same composite message to both channels. Since each channel knows the signal they transmitted, it is a simple matter for 
them to extract the unknown message from the other channel. 

However, parallel messaging dictates that messages between channels must arrive within certain time blocks, which requires 
timing control to minimize transmission delays. As the likelihood of large delays increases, the likelihood of missed messages 
also increases. Although coherent detection, in which all channels operate on the same time base, minimizes timing delays, 
it requires precise control of phase. Such control is unlikely in unstable environments. In these situations, designers trade the 
precision of coherent detection for the simplicity of non-coherent detection, which does not require a frequency or phase base. 
In their implementation of a non-coherent system, Valenti and Ferrett of West Virginia University and Torrieri of ARL, “Non¬ 
coherent Physical-Layer Network Coding with FSL Modulation: Relay Receiver Design Issues" (page 221), use frequency shift 
keying (FSK) to encode signals. In FSK, the network needs to detect only changes in frequency, not measure absolute frequency, 
to demodulate signals. Because the encoding does not require a reference frequency, it reduces network requirements on 
frequency control and on power control. 

In this paper, Valenti, Torrieri and Ferrett present the theory, design, and simulated operation of FSK for non-coherent physical- 
layer network coding. They show that the encoding provides high throughput with low energy efficiency, which is typical of non¬ 
coherent systems. To overcome power losses, they employ low rate turbo codes. The authors show that, for the same energy 
efficiency, their digital network coding (DNC) approach achieves a higher throughput rate in comparison to more traditional 
link-layer network coding (LNC). In one example, the authors show a one-third rate increase. That is, four messages can be sent 
using the authors’ approach for every three messages transmitted using LNC. 

Aquino of the Imperial College London, Bologna of the Instituo de Alta Investigacion Chile, Grigolini of the University of North 
Texas, and West of ARL investigated the transport of information between two complex networks to determine the conditions 
under which the information transport is maximized in, “Transmission of Information Between Complex Systems: l/f Resonance" 
(page 233). If network theory is ever to become Network Science, there must exist general principles that transcend the level of 
disciplinary mechanisms and that remain valid across the physical, biological, and social phenomena described above. In this 
paper, Aquino etal. use arguments from non-equilibrium statistical physics to make the case that if the complexity of a network 
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can be quantified, then the information transferred is maximized through the matching of the complexity of the interacting 
networks. 

The complexity of networks, from avalanches of neuronal firings within the human brain to the blinking times in colloidal 
quantum dot fluorescence, is manifest in IPL distributions. The level of complexity of these networks is quantified by the value 
of the IPL index. Of particular importance is the manner in which information is shuttled back and forth between such networks 
and whether or not there exists a general principle that guides the flow of information. The mathematician Norbert Wiener 
speculated thatsuch a principle existed for information-dominated processes. A half century later, Aquino etal. in, “Transmission 
of Information Between Complex Systems: l/f Resonance" (page 233), proved Wiener’s conjecture to be true by establishing 
the Principle of Complexity Management in which information transport is mitigated by the relative values of the networks’ IPL 
indices. They establish the principle by generalizing the fluctuation-dissipation theorem from non-equilibrium statistical physics 
to non-ergodic processes. In this way, it is possible for a network with lesser energy, but greater information, to organize a 
network with greater energy, but lesser information; this is an information-dominated process. 

On one hand, a speaker who delivers a lecture standing motionless in front of an audience, talking in a monotone, transfers 
little or no information. The group very quickly drifts off and stops listening almost independently of the content of the lecture. 
On the other hand, another speaker with a modulated voice, striding back and forth across the front the room while gesturing 
dramatically, can hold the audience’s attention. The complexity of the presentation matches that of the typical cognitive activity 
of the brain and information finds its way into the consciousness of the audience member. The success of the second speaker, 
as well as a myriad of other social and psychological phenomena, are explained by the Principle of Complexity Management and 
the relative complexity of the two interacting networks. 

The following collection of recently published journal articles is a select sample of the broad spectrum of ARL’s network science 
research investigating physical, social/cognitive, information, and communication networks, and their complex interactions. 
They represent foundational knowledge that will enable optimized human performance and enhanced speed and precision in 
complex networked environments. 
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We address the issue of the dynamical origin of scale-free link distributions. We study 
a two-dimensional lattice of cooperatively interacting units. Although the units interact 
only with the four nearest neighbors, a sufficiently large cooperation strength generates 
dynamically a scale-free network with the power law index v approaching 1. We explain 
this result by using a new definition of network efficiency determined by the Euclidean 
distance between correlated units. According to this definition the link distribution 
favoring long-range connections makes efficiency increase. We embed an ad hoc scale-free 
network with power index v > 1 into a Euclidean two-dimensional space and show that 
the network efficiency becomes maximal as v approaches 1. We therefore conclude that 
v = 1 emerging from the cooperative interaction of units may be a consequence of the 
principle of network maximal efficiency. 

© 2011 Elsevier B.V. All rights reserved. 


1. Introduction 

The work of Albert and Barabasi (AB) [1] prompted interest in scale-free networks, namely networks with the inverse 
power law distribution of links 

P(k ) oc (1) 

with k denoting the number of links per node. The AB model assumed the preferential attachment of the Yule process and 
gave rise to the power-law index v = 3. The ubiquity of these networks has led to the research into the dynamical origin 
of the scale-free property. Fraiman et al. [2] has recently studied the two-dimensional Ising model at criticality and using 
a suitable threshold p + established a link between the nodes with a dynamical correlation exceeding that threshold. The 
interesting result of that procedure was the emergence of the scale-free distribution density of Eq. (1) with a power index 
v that, with the proper choice of average connectivity (/<), gives values close to v = 2 [3]. It is remarkable that the same 
procedure, applied to the brain, yields results very close to those afforded by the Ising model [2]. 

Like the work of Ref. [2], we begin with a regular two-dimensional network, but then take an entirely different approach. 
Rather than using the Ising model, we adopt the Decision Making (DM) model of Refs. [4,5] for the creation of the dynamically 
induced network. The key result is that while we also find the emergence of a scale-free network at criticality, our network 
has an exponent v close to 1. 
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This finding suggests that the cooperative interaction between the units in the network generated from the DM model 
should make the network very efficient. However, we find herein that the conventional measures of network efficiency [6] 
are relatively insensitive to v < 2. The second result of this paper is a new measure of network efficiency, the perception 
length, that overcomes this limitation, and shows that network efficiency increases as v approaches 1. 

The outline of this paper is as follows. We devote Section 2 to proving the dynamical emergence of the scale-free network 
with v = 1. Section 3 shows that v approaching 1 maximizes the perception length while realizing a statistical condition 
equivalent to the dynamically generated network. We devote Section 4 to concluding remarks. 


2. Dynamically induced scale-free network with v = 1 


The question of whether the network emerging from dynamical correlations is scale-free is not trivial. For this paper, we 
adopt the Decision Making (DM) model of Refs. [4,5]. Although the DM model yields a phase transition very similar to that 
of the conventional Ising model [7], the DM model does not have a Hamiltonian foundation. Furthermore, while the Ising 
model rests on the action of a thermal bath at a finite temperature, and the single spins in isolation and with no thermal 
driving would be dynamically frozen, the units of the DM model, in isolation, are driven by Poisson dynamics. This difference 
may lead to a dynamically created scale-free network different from that of Ref. [2]. 

The DM model considers N discrete variables located at the nodes of a two-dimensional square lattice. Each unit s, is a 
stochastic oscillator and can be found in either of two states, “yes” (+1) or “no” ( — 1). The cooperation among the units is 
realized by setting the transition rates between two states to the time-dependent form: 


gn(t) =ge K 


M 2 (t)-M 1 (t) 

M 



and 
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where M denotes the total number of nearest neighbors, which in the case of a two-dimensional lattice results in M = 4. 
M-i(t) and M 2 (t) are the nearest neighbors who are making the decision “yes” and “no”, respectively. The single unit in 
isolation, I< = 0, fluctuates between “yes” and “no”, with the rate g. When coupling constant I< > 0, a unit in the state “yes” 
(“no”) makes a transition to the state “no” (“yes”) faster or slower according to whether M 2 > Mi (Mi > M 2 ) or M 2 < Mi 
(Mi < M 2 ), respectively. The quantity I( c is the critical value of the control parameter K, at which point a phase-transition 
to a global majority state occurs. It can be shown that for a lattice of infinite size I< c = 2 ln(l + *J2) ^ 1.7627. 

All simulations are implemented on a lattice of N = 100 x 100 nodes with periodic boundary conditions. In a single time 
step, a run over the entire lattice is performed and for every node an appropriate transition rate (g 12 org 2 i) is calculated 
according to which a node is given possibility to change its state. Under those conditions, setting parameter g = 0.01, the 
phase transition to the global majority case occurs at I< c ~ 1.70. To study the dynamically induced network topology, we 
considered the DM model with this critical value of coupling constant. After an initial 10 6 time steps, we record 2000 lattice 
configurations, obtaining the dynamics of each node {s/(t)}. In the next step, we evaluate the linear correlation coefficient 
between the i-th and the j-th node: 


r ~ {Si(t)Sj(t)) - {Si(t)){Sj(t)) 

r(i,J ) = , - , (4) 

y /()> - (S/(t)) 2 y/(S J 2 (t)) - ( Sj(t )) 2 

where (■ ■ •) stands for the time average. Nodes i and j of the dynamically induced network are connected by a link when the 
correlation between nodes i and j of the two-dimensional lattice is greater than a given positive threshold p + . 

We investigated a wide range of positive thresholds p + = [0.10 : 0.90]. As expected, correlation networks constructed 
with increasing value of the threshold p + are characterized by a decreasing number of links, which results in significant 
changes in the corresponding degree distribution. Networks obtained with low values of the threshold p + result in narrow, 
peaked degree distributions and relatively high values of the mean degree (/<). Topologies resulting from very strong 
correlations, however, are characterized by a distribution of links that has a mean close to the minimum value of (k) = 1 
and drops off very rapidly. Interestingly, in between those conditions there exists a correlation threshold that creates a 
dynamically induced network with a power-law distribution of degrees that is shown in Fig. 1. The connectivity of a network 
obtained under this particular condition is inspected and only nodes that are part of a giant component are considered to be 
a part of new topology. The resulting network consists of 6740 nodes with a mean degree of (k) = 38. For a correlation 
threshold p + = 0.61, we find that the distribution of links follows an inverse power law with the scaling parameter 
v = 1.20 ±0.23 ( R 2 = 0.9556). 

How can the dynamical origin of this extremal condition be explained? From an intuitive point of view there exists a 
connection between the scale-free distribution of links and Zipfs law [8,9]. In fact, if we rank the nodes moving from the 
richest to the poorest, according to Zipf [10] we should obtain the relation between the connectivity k and rank r: 
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Fig. 1. Degree distribution p(/<) for correlation network obtained with a positive threshold p + = 0.61. The line follows estimated power-law relation with 
the exponent of v = 1.20. 

with the power-law index a being very close to 1. If we select randomly a given rank r with probability 7i(r)dr of being in 
the interval [r, r + dr], this should be equal to the probability p(k)dk, of having k links in the interval [k, k + d/<], 

p(k)dk = 7r(r)dr. (6) 

Assuming that jr(r) is constant, we obtain after some algebra 
1 

p(k) oc — r , (7) 

/<’+« 

and thus the two power-law indices are related by 

0 = 1 + —. ( 8 ) 

a 

This leads us to conclude how an efficient complex network is organized if Zipfs law applies. For example, this suggests 
that the intelligence of human brain should be characterized by v = 2. The recent research work of Fraiman et al. [2] seems 
to confirm this prediction. The authors of Ref. [2] studied the two-dimensional Ising model at criticality and using a suitable 
threshold p + established a link between the nodes with a dynamical correlation exceeding that threshold. The interesting 
result of that procedure was the emergence of the scale-free distribution density of Eq. (1) with a power index v that, with 
the proper choice of average connectivity (/<), gives values close to v = 2 [3]. It is remarkable that the same procedure, 
applied to the brain, yields results very close to those afforded by the Ising model [2]. 

It is also interesting to notice that Boettcher and Percus [11] applied the method of extremal optimization to a Ising- 
like model to evaluate the rank distribution of Eq. (5) and found that when the number of units tend to infinity the rank 
distribution tends to a = 1. This result implies on the basis of the earlier remarks that the cooperation of units in an Ising- 
like model is expected to confirm ZipFs prediction and consequently v = 2. For results confirming this prediction, see also 
Refs. [12,13]. 

We have found, therefore, that the DM model generates a scale-free network at criticality, but with v approaching 1 
which violates Zipfs condition of a = 1. The emergence of a 1 corresponds to the number of links k dropping very 
quickly when we move from r = 1 to subsequent ranks. 

3. Embedding scale-free networks in a two-dimensional regular lattice 

The efficiency of a network has been mainly studied on the basis of topological length L [6]. The topological distance 
between two nodes is the minimum number of steps needed to move from one node to another. The quantity L is the 
corresponding mean value of the minimum topological distance between nodes. As a consequence, the emergence of scale- 
free networks can be explained by making the conjecture that real networks evolve in time so as to realize maximal 
efficiency. In the case of the scale-free network this efficiency is measured by the minimal value of L. 

In 2003, Cohen and Havlin [14] determined that scale-free networks are very efficient. They calculated that networks 
with N nodes have a topological length L given by 

L ~ ln(lnN), (9) 

for a power-law index v < 3, which is smaller than in the AB model. 

As a consequence of Eq. (9), the topological length L remains small even if N is very large. This relation between L and N 
suggests that the complex networks with v < 3 are very efficient. 

The region v < 2 has not been studied by many authors. As an interesting example of earlier work in this region we 
refer to the work of Refs. [15,16]. In both cases the authors adopt a kind of generalization of the perspective of Albert and 
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Fig. 2. Perception length P as a function of v for ad hoc scale-free networks. The circles represent networks with constant (k) = 4.95; the squares represent 
networks with constant N = 1313. 


Barabasi [1], namely, they grow a network with special prescriptions allowing them to overcome the limiting condition 
v > 3 and yielding v <2. 

More recently there has been some interest in the efficiency of hierarchical networks with v < 2 [17] and v ~ 1 [18,19]. 
These networks are characterized by very small topological lengths. In the case of Ref. [19], for N -> oo the authors predict 
the value L = 2. Our numerical calculations show that L is insensitive to v in the range 1 < v < 2. Thus, there are good 
reasons to believe that the prediction of Eq. (9) applies with no significant dependence on v. 

There are also good reasons to believe that the dynamic source of scale-free networks has a probabilistic rather than 
a deterministic origin, as we shall demonstrate. The work of Fraiman et al. [2], although fitting ZipFs condition, allows us 
to define the concept of perception length, which is then used to shed light onto the condition v = 1. The nodes of a 
two-dimensional regular lattice, where each unit interacts with only its four nearest neighbors, correspond to a network 
whose topological efficiency is very low. However, if we let the units cooperatively interact, and we set a link between two 
nodes, then when their dynamical correlation is large enough, the resulting dynamical network may have a much lower 
L. We record the Euclidean distance between each pair of correlated nodes, and we define the network efficiency as the 
corresponding mean value. More precisely, we define the network efficiency by means of the quantity 



where 



( 10 ) 


( 11 ) 


and dij is the Euclidean distance between node i and its nearest neighbor j. We refer to P as the perception length of the 
system, this being the measure of the network efficiency replacing 1 /L. For numerical analysis, the perception length P is 
calculated as follows. 

1. Create a 2-dimensional square lattice, where a is the length (in units of the lattice) of the side of the square, and it is 
necessary that < a. 

2. Embed the network on the lattice by randomly assigning the network nodes to lattice points. 

3. Using periodic boundary conditions for the lattice, calculate the Euclidean length of each link. 

4. These lengths are then used to calculate P in accordance with Eq. (10). 


This procedure was then applied to scale-free networks generated according to the method of Catanzaro et al. [20] with 
(k) = 4.95 and values of v ranging from 1 to 2. The nodes were embedded randomly in a square lattice of size just sufficient 
to contain the network with the largest number of nodes. The perception lengths were calculated, and the results show that 
over this range the perception length increases as v approaches 1 (Fig. 2). 

We now compare the sensitivity of the conventional network efficiency (defined as the reciprocal of the network length 
1/L) and the perception length P to the variation in v. These results are shown in Fig. 3. Note that the perception length P and 
1 /L have been normalized to allow the presentation in a convenient scale. These results show that the perception length, as 
a measure of network efficiency, is more sensitive to changes in v than the conventional 1 /L. 

Applying this approach to the DM network discussed in Section 2 above yields a mean perception length = 1382. To 
provide a basis for comparison, we calculated the perception length for a comparably sized ad hoc scale-free network, also 
prepared in accordance with the algorithm in Ref. [20]. The results are compared in Table 1, which shows a very good 
agreement. 
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Fig. 3. Network efficiency as a function of v. In both figures, the circles represent the normalized perception length, and the squares represent the 
normalized 1 /L Error bars are not shown; standard deviations for the normalized perception lengths were very small (of the order of 0.005). 


Table 1 

Comparison of perception lengths for the DM network and a comparable 
ad hoc scale-free network. 



DM network 

Ad Hoc network 

Degree distribution exponent, v 

1.20 

1.19 

Number of nodes, N 

6740 

6744 

Number of links, k 

128,448 

128,413 

Perception length, P 

1382 

1373 


4. Concluding remarks 

The main result of this paper is the proof that the perception length P defined by Eq. (10) is more sensitive to v in the range 
1 < v < 2, and may therefore be a more useful measure of network efficiency than the topological length L [6]. This new 
definition is based on the assumption that the scale-free networks, even those derived using deterministic and hierarchical 
arguments [17-19], are compatible with the dynamical derivation. The apparent conflict between the two kinds of network 
is explained by noticing that the correlation between different nodes evaluated according to the numerical prescription 
of Section 2 rests on time windows of finite size and moving these windows to different time regions generate different 
networks with the same hierarchical topology. An alternative but probably more attractive way to explain this is that the 
leadership moves in time from some nodes to others. The nodes of the two-dimensional regular lattice that we used for the 
dynamical derivation of the free-scale distribution of Eq. (1) are totally equivalent, and the rank differences between the 
richest and the poorest nodes are temporary and in the long time run the same high rank is shared by all the nodes. 

There are some questionable aspects of the dynamical derivation of v = 1, realized in Section 2. First of all this result 
depends on an arbitrarily selected threshold p + . In addition, the power index v derived from the data in Fig. 1 is affected 
by the large fluctuations of the distribution density p(k) for large values of k, thereby explaining why we derive v = 1 
rather than v = 2, as in the work of Ref. [3]. However, on one hand, this observation led us to the discovery of the new 
measure of network efficiency of Eq. (10) and, on the other hand, as already mentioned in Section 1, the DM model is not 
quite equivalent to the Ising model. 

Furthermore, the interesting fact that a = 1 (Zipfs law) is related to v = 2 is not totally convincing. The authors of 
Ref. [9] established a connection between topological and dynamical properties by running a random walker on a scale-free 
network with the power index v. They found that the waiting time distribution density for returns of the walker to a given 
node are described by an inverse power law with power index /x = 3 — v. Using a generalized central limit theorem, they also 
proved that the same index pc also describes the inverse power law form of the probability density of relative frequencies 
p(f ) ~ / _/ \ so that, in analogy with Eq. (8), a rank-frequency law/ ~ r~ a yields /x = 1 + 1 /or, hence a = 1/(2 — v), 
namely a = 1 for v = 1. It is interesting to note that v = 1 yields /x = 2, and that the value /x ~ 2 emerges from the recent 
neurophysiological work of Refs. [21,22]. 

Thus, in addition to the new efficiency measure of Eq. (10), which is the unquestionable result of this paper, we are also 
led to make the plausible conjecture that the DM model may be an adequate tool to study the emergence of intelligence 
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in the sense pointed out by Couzin [23] and Cavagna et al. [24]. According to these authors the flocks of birds, as well as 
the brain [25,26], are systems at criticality, with a correlation length as large as the size of these systems. We are also led 
to make the conjecture that the dynamical generation of a scale-free network with v = 1 may be the manifestation of the 
principle of network maximal efficiency: a network of cooperatively interacting units evolves naturally so as to maximize 
P, the new measure of network efficiency. 
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Abstract —Consensus algorithms have generated a lot of interest 
due to their ability to compute globally relevant statistics by only 
exploiting local communications among sensors. However, when 
implemented over wireless sensor networks, the inherent iterative 
nature of consensus algorithms may cause a large energy con¬ 
sumption. Hence, to make consensus algorithms really appealing 
in sensor networks, it is necessary to minimize the energy neces¬ 
sary to reach a consensus, within a given accuracy. We propose a 
method to optimize the network topology and the power allocation 
over each active link in order to minimize the energy consumption. 
We consider two network models: a deterministic model, where 
the nodes are located arbitrarily but their positions are known, 
and a random model, where the network topology is modeled as 
a random geometric graph (RGG). In the first case, we show how 
to convert the topology optimization problem, which is inherently 
combinatorial, into a parametric convex problem, solvable with 
efficient algorithms. In the second case, we optimize the power 
transmitted by each node, exploiting the asymptotic distributions 
of the eigenvalues of the adjacency matrix of an RGG. We further 
show that the optimal power can be found as the solution of a 
convex problem. The theoretical findings are corroborated with 
extensive simulation results. 

Index Terms —Consensus algorithms, minimum energy con¬ 
sumption, random graphs, sensor networks, topology control. 


I. Introduction 

A VERAGE consensus algorithms have received consider¬ 
able attention in recent years because of their ability to 
enable globally optimal decisions using only local exchange of 
information among nearby nodes [ 1]—[3]. The price paid for 
this simplicity and the underlying decentralized philosophy is 
that consensus algorithms are inherently iterative. As a con¬ 
sequence, the implementation of consensus algorithms over a 
wireless sensor network (WSN) requires an iterated exchange 
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of data among the nodes, which might cause an excessive en¬ 
ergy consumption. This must be contrasted with a centralized 
strategy where there is a sink node that, after collecting all the 
observations from the sensors (perhaps over multiple hops), is 
virtually able to compute the desired statistic in a single shot. 
Hence, to make consensus algorithms practically appealing in 
a sensor network context, it is necessary to minimize the en¬ 
ergy consumption necessary to reach consensus. Clearly, the 
network topology plays a fundamental role in determining the 
convergence rate [4]. It is well known that, as the network con¬ 
nectivity increases, so does the rate of convergence. However, 
having a densely connected network requires a high power con¬ 
sumption to guarantee reliable direct links between many nodes. 
In principle, having a fully connected network is equivalent to 
having as many sink nodes as sensors, so that the convergence 
time of fully connected networks is minimum. However, the 
power consumption necessary to maintain a fully connected net¬ 
work is also maximum. On the other hand, a minimally con¬ 
nected network entails low power consumption to maintain a 
few links, but, at the same time, it requires a large convergence 
time. Since what really matters in a WSN is the overall power 
spent to achieve consensus, this paper addresses the problem 
of finding the optimal network topology that minimizes this 
overall power consumption, taking into account convergence 
time and transmit power necessary to establish reliable links 
jointly. The search for the optimal topology is, per se , a combi¬ 
natorial problem whose solution becomes prohibitive even for 
small scale networks. Nevertheless, we will present a relaxation 
technique that converts the problem into a convex problem, with 
minimum performance loss. 

If the links among the nodes are symmetric, or, equivalently, 
if the graph describing the network topology is undirected, the 
convergence rate can be lower bounded through the so called 
algebraic connectivity , defined as the second smallest eigen¬ 
value of the graph Laplacian [5]. For this reason, there have 
been works aimed at maximizing the algebraic connectivity of 
a given undirected graph by a suitable choice of the weights as¬ 
sociated to each edge [6], [7]. Alternatively, in [8] it was shown 
how some network topologies, such as small world graphs, can 
greatly increase the convergence rate. In [9], the authors show 
that nonbipartite Ramanujan graphs constitute a class of topolo¬ 
gies maximizing the convergence rate. In [10] it was shown how 
to add edges from a given set to a graph in order to maximize its 
algebraic connectivity. Other works, for example [11], proposed 
strategies to improve the convergence rate of gossip algorithms 
through geographic routing. 
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However, in all these works, the focus is always on conver¬ 
gence time and there is no cost associated with the establish¬ 
ment of the graph topology. Conversely, in our work, since the 
graph represents a real network, we consider as the optimiza¬ 
tion metric the power consumption necessary to maintain re¬ 
liable communication links among the nodes, taking into ac¬ 
count the radio propagation model, multiplied by the number of 
iterations necessary to achieve consensus. From this perspec¬ 
tive, enforcing a small world, scale-free, or Ramanujan graph 
topology may not be the best choice for a wireless network, 
whose topology should depend on the application at hand as 
well as on the propagation model. 

In the wireless communication network context, there have 
been works on optimizing the network topology in order to min¬ 
imize the power consumption necessary to guarantee connec¬ 
tivity, e.g., [12], [13]. These works concentrate on the communi¬ 
cation task and do not make any specific reference to the running 
application. However, it is now well established, in the sensor 
network context, that, whenever possible, an efficient wireless 
sensor network design should take into account the specific goal 
of the sensor network [14]. For this reason, we focus on the 
achievement of consensus in a WSN. It is well known that one 
of the most crucial parameters in WSNs is energy consumption, 
because in many contexts it is hard to recharge the batteries or 
scavenge energy from the environment. For this reason, in most 
applications, minimizing energy consumption is more appro¬ 
priate than minimizing convergence time (although, there are 
important applications where the latter could be more impor¬ 
tant). In [15], the minimum energy consumption problem was 
studied, assuming a common transmit power. As shown in [15], 
there typically is an optimum power that minimizes the energy 
necessary to achieve consensus within a prescribed accuracy. 

In this work, we generalize the initial idea suggested in 
[15] and we propose a method for optimizing the network 
topology and the power allocation across every link in order 
to minimize the energy necessary to achieve consensus. We 
consider two classes of networks: a) deterministic topologies, 
with arbitrary, but known, node locations, and b) random 
geometries, with unknown node locations, modeled as random 
variables. In the deterministic case, we optimize both topology 
and power allocation. Differently from [6], we do not assume 
any prior topology, as the topology comes out as a result of the 
optimization. Topology optimization is, in general, a combina¬ 
torial problem and hence an NP-hard problem (recall that an 
undirected graph composed of n nodes may assume 2 n ( n_1 )/ 2 
topologies). To tackle this issue, we propose a relaxation tech¬ 
nique that allows us to formulate topology optimization as a 
convex parametric problem. Then, we show that the effect of 
this relaxation on the performance is negligible. 

In the random topology case, where the internode distances 
are unknown, we show how to optimize the single (common) 
transmit power, modeling the network topology as a random 
geometric graph, a model suitable for wireless networks. We 
provide both theoretical and simulation results, exploiting the 
theory of the eigenvalues of random geometric graphs. 

The paper is organized as follows. In Section II, we briefly re¬ 
view the consensus algorithm. In Section III, we introduce our 
communication model and formulate the optimization problem. 


Section IV is devoted to topology optimization for arbitrary 
networks. In Section V, we start by providing a closed form 
expression, albeit approximate, for the algebraic connectivity 
of a random geometric graph. Then we use this expression to 
proceed with the topology optimization for random geometric 
graphs. The analytical findings are corroborated with extensive 
simulation results. 


II. Brief Review of Consensus Algorithms 


Let us consider a wireless network composed of n sensors. 
The network topology can be represented as an undirected graph 
G = {V, E\ where V denotes the set of n vertices (nodes) 
Vi and E C V x V is the set of bidirectional edges (links) 
Cij = eji connecting Vi and Vj. Furthermore, let A be the 
n x n-dimensional symmetric adjacency matrix of the graph G , 
with elements a,i 3 = 1 if £ E and = 0 otherwise. Ac¬ 
cording to this notation and assuming no self-loops, i.e., an = 0, 
Vz = 1,..., n, the degree of node Vi is defined as deg(z^) = 

n n 

S a ij — Y a ji • The degree matrix D is defined as the di- 
j =i j =i 

agonal matrix whose diagonal entry is da = deg(^). Let A fi 
denote the set of neighbors of node z, so that |A*| = deg(z; ? ;) 1 . 
The Laplacian matrix of the graph G is the n x n symmetric 
matrix L := D — A, whose entries are 


Gj — — 


deg(?/;) if j = i 



Given a set of measurements Xi( 0), collected by node z, for 
z = 1,..., n, the goal of the consensus algorithm is to allow 
every node to compute a globally optimal function of the mea¬ 
surements, say , through a decentralized mechanism that does 
not require the presence of a sink node, but builds on the interac¬ 
tion among nearby nodes only. Let us consider, for simplicity, 
the case where x* is the average of the measurements. In this 
case, reaching an average consensus can be seen as the mini¬ 
mization of the disagreement between the states Xi of the inter¬ 
acting nodes. One of the nice properties of the Laplacian is that 
the disagreement can be expressed as a quadratic form built on 
the Laplacian [16] 

1 n 

J ( x ) := 4II E( Xi ~ X E 

i =1 jeAfi 



i =1 j =1 


1 

2 


x Lx. 



An important property of the Laplacian is that it has, by con¬ 
struction, a zero eigenvalue, whose multiplicity is equal to the 
number of connected components of a graph. Hence, a graph is 
connected if its zero eigenvalue has multiplicity one [5]. Fur¬ 
thermore, if the graph is connected, the eigenvector associated 
with the null eigenvalue is the vector 1, composed of all ones. 

The minimization of the quadratic form in (2) can be achieved 
using a simple steepest descent technique. In continuous time, 
the minimum of (2) can be reached through the following up¬ 
dating rule [2]: 

x(t) = —Lx(t) (3) 

'By | • | we denote the cardinality of the set. 
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initialized with x(0) := xo 2 . By construction, the eigenvalues 
of L are all nonnegative. Hence, the convergence of (3) is guar¬ 
anteed. In particular, if the graph is connected, the state vector 
x(t) in (3) converges to the projection of the initial state xq onto 
the nullspace of L, spanned by the vector 1, i.e., [2] 

\ 

x(t) —> — 11 t xq . (4) 

t—»• oo n 


This corresponds to having all the nodes converging 
to a consensus on the average of the initial observa¬ 
tions x* = Y^i=i x i(®)/ n ' Moreover, the convergence 

rate is lower bounded by the slowest decaying mode, 
i.e., by the second smallest eigenvalue of L, A 2 (L), 
also known as the algebraic connectivity of the graph. 
More specifically, if the graph is connected, the dynamic 
system (3) converges to consensus exponentially [2], i.e., 
|| x(t) — x*l\\ < ||x(0) — x*l\\0(e~ rt ), with r = X 2 (L). 
Defining the disagreement vector as A x(t) = x(t) — x*l, we 
can write ||Ax(t)|| < ||Aa?(0)||O(e -rt ). As a consequence, 
the convergence time T c can be defined (see also [2]) as the 
time necessary for the slowest mode of the dynamical system 
(3) to be reduced by a factor 7 « 1, i.e., the time for which 
||Aa;(T c )|| < 7 ||Ax( 0 )||. Hence, we can set 


jogCr) 

A i(L) ' 



III. Optimization Criterion 


It is well known that one of the most critical issues in WSN is 
energy consumption. Neglecting for simplicity the power spent 
for processing operations with respect to the power to be used 
to enable wireless communications, the overall power spent to 
reach consensus is the product between the sum of the power 
Ptot necessary to establish the communication links among the 
nodes and the number of iterations A r i t necessary to achieve 
consensus. The exchange of information among the nodes is 
supposed to take place in the presence of a slotted system, with a 
medium access control (MAC) mechanism that prevents packet 
collisions. Denoting by T s the duration of a time slot unit, the 
number of iterations is then approximately 3 N[ t = T c /T s . 

Introducing the power coefficients pp, i / j, denoting the 
power used by node i to transmit to node j with = pp, and 
the binary coefficients op assessing the presence (a ? y = 1) of a 
link between nodes i and j or not (op = 0), the power spent by 
the whole network in each iteration is then P tot = JA ■ dijPij- 
Using (5), our goal is to minimize the following performance 
metric: 


s = PtotVt = K 


E ri ■sr^, ri 

i=1 Z^j =1 a ijPij 

WW) 


( 6 ) 


2 The discrete-time counterpart of (3) is [2] 


x[k + 1] = x[k] — eLx[k] W;c[fe] 


where t is a parameter chosen so as to ensure that no eigenvalue of W is greater 
than one in modulus. 

3 We neglect rounding errors, which tend to vanish if the duration of the up¬ 
dating time slot is small with respect to the convergence time. 


where K incorporates all irrelevant constants. This metric is 
proportional to the integral of power consumption with respect 
to time and then it represents an energy consumption. In (6), 
we have made explicit the dependence of the Laplacian L on 
the vector a = A{\) containing all binary coefficients op , since 
finding these coefficients is the goal of our optimization. More 
specifically, our goal is to find the set of active links, i.e., the 
nonzero coefficients op , and the powers pp that minimize (6), 
under the constraint of maintaining network connectivity, which 
entails A 2 (L(a)) > 0. The problem can then be formulated as 
follows: 


min 

a,p 

S.t. 


E n v—' n 

i =1 Z^7 = l a tjPij 


A 2 (L(a)) 

e < A 2 (L(a)) [P.0] 

CLij E {0,1} 

Pij >0 V i, j = l,...,n 



where e is an arbitrarily small positive constant used to prevent 
the algebraic connectivity from going to zero, which would cor¬ 
respond to a disconnected network and p is the vector with en¬ 
tries pp . 

Since the topology coefficients are binary variables, [P.0] is 
a combinatorial problem, with complexity increasing with the 
size n of the network as 2 n ( n_1 )/ 2 . Hence, its solution, for 
medium/large scale networks is prohibitive. Our objective is to 
modify [P.0] in order to turn it into a convex problem, with neg¬ 
ligible performance losses. 

A first simplification comes from observing that the coef¬ 
ficients a,ij and p ZJ are not independent of each other. Their 
dependence is indeed a consequence of the radio propagation 
model. In this work, given the complexity of the topology op¬ 
timization problem, we assume a fairly simple communication 
model. We state that there is a link between nodes i and j , and 
then aij = 1, if the signal-to-noise ratio SNRy at the receiver 
node j , when node i transmits, exceeds a minimum value s m i n , 
i.e., SNRy > s m [ n . If we denote by pr 3 the power received by 
node j when node i transmits, and by cr 2 the noise power, as¬ 
sumed for simplicity to be the same at each receiving node, we 
have = 1, if pRj > s m [ n a 2 := p m in, or otherwise a tJ = 0. 
Assuming flat fading channel modeling, we use the following 
propagation model: 


_ Pij 

PR3 1 + (fij/roV 



where rp is the distance between nodes i and j, and ij is the 
path loss exponent. The parameter 77 plays the role of a scaling 
factor or reference distance, and typically corresponds to the so 
called Fraunhofer distance, such that, if rp >> r 0 , the receiver 
is in the transmit antenna far-field, where the received power 
is inversely proportional to r 7 - - ; conversely, if rp <C To, the 
receiver is in the transmit antenna near-field, where the received 
power is approximately equal to the transmitted one. The unity 
term in the denominator of (8) is used to avoid the unrealistic 
situation in which the received power could be greater than the 
transmitted one. Given the propagation model (8), the relation 







25 


between the power coefficients p^ and the topology coefficients 
Orij is then 


1 if Pij ^ Pmin 
0 otherwise 

In the following sections we will show how to relax this rela¬ 
tion in order to simplify the solution of the optimal topology 
control problem. We will consider two scenarios: a) a deter¬ 
ministic topology, with arbitrary node geometry, where the dis¬ 
tances between the nodes are known, and b) a random topology, 
where the positions of the nodes are unknown and modeled 
as random variables. In case a), since the distances among the 
nodes are known, we optimize the power allocated to each link. 
This strategy is also instrumental in determining the topology, 
which is equivalent to finding the n(n — l )/2 entries a ? ; 7 of 
the adjacency matrix. Conversely, in the random topology case, 
since the distances are not known, we assume broadcast com¬ 
munications, and look for the optimization of the coverage ra¬ 
dius of each node. The deterministic topology case will be the 
subject of Section IV, whereas the random case will be studied 
in Section V. 




IV. Optimal Topology and Power Allocation for 

Arbitrary Networks 


In the case where the distances between the nodes are known, 
the optimization criterion amounts to solving problem [P.0], 
which involves a combinatorial strategy that makes the problem 
numerically very hard to solve, especially for medium/large 
scale networks. The relation (9) reduces the set of unknowns 
to the set pij , but the problem still retains most of its difficul¬ 
ties. To make problem [P.0] to be numerically tractable, we 
introduce a first relaxation so that, instead of requiring a ^ 7 to 
be binary, we assume a^ to be a real variable belonging to 
the interval [0,1]. Under this assumption, problem [P.0] is 
not combinatorial anymore, but it is still a nontrivial nonlinear 
constrained problem. The first important contribution of this 
paper is to propose a relaxation technique that transforms the 
previous problem into a convex problem that can be solved with 
well established and efficient numerical tools. We achieve this 
goal by first introducing the following relationship between the 
coefficients a^ and the distances rjj : 



1 

1 + (■ rijfr Ci3 ) a 


( 10 ) 


where a is a positive coefficient and r Cij is the coverage radius, 
which depends on the transmit power. According to (10), a^ 
is close to one when node j is within the coverage radius of 
node i, i.e., r t j <C r Cij , whereas a^ is close to zero, when 
U 7 r Cij . The switching from zero to one can be made steeper 
by increasing the value of a. 

Given the propagation model ( 8 ), the coverage radius r Cij is 
related to the power p^ and the minimum power required for 
reliable communication p m { n , as follows: 



TO 


Pij 

P min 



1/ri 


(ID 


Plugging (11) in (10), the coefficients aij can be written explic¬ 
itly in terms of the power coefficients as follows: 


a ij — a ij(Pij ) — 


r o (Pij -Pmin) a/r> 


r 0 (Pij ~ PminV^ + r ijPmin 


( 12 ) 


This relation can be also inverted to find the coefficients pij as 
a function of a t j , as follows: 


Pij — Q ( ^'ij ) — Pmin T ^1 ^ 


a ij 


r]/a 


1 CLij 


03) 


r v . 

with ki = PminAf. Expression (12) becomes our relaxed ver- 

r o 

sion of (9) and it allows us to reduce the set of variables to the 
only power vector p. Consequently, problem [P.0] can be re¬ 
laxed into the following problem: 


min 

p 

s.t. 


p T 1 

MTpjj 

e < A 2 (£(?>)) 
Pmin1- — P 


[P-1] 


(14) 


where, thanks to (12), the Laplacian is now written explicitly 
in terms of the power coefficients p t j . In principle, the last in¬ 
equality in (14) makes any link feasible. But this does not imply 
that the final network will be fully connected, because some co¬ 
efficients aij might turn out to be equal zero, implying that the 
link between node i and j is not active. The first important re¬ 
sult, related to the solution of (14), is the following. 

Theorem 1: Given the propagation model in ( 8 ), using the 
relations ( 12 ) between the topology coefficients a t j and the 
power terms pij, problem [P.l] is a convex-concave fractional 
problem if p > a. 

Proof: Let us consider the objective function in [P.l]. 
The numerator of [P.l] is clearly a convex function of p. We 
only have to prove the concavity of the algebraic connectivity 
A 2 (L(p)) with respect to the transmit powers. As a first step, 
we prove that a t j (pij ) is a concave function of p tJ . Then, we 
use this to show that A 2 ( ■) is a concave function of p. 

(i) ( p^) is a concave function of pij : We compute the 

second-order derivative of the function in ( 12 ) with respect to 

Pij 


d aij (Pij ) 
d 2 Pij 


h[(a - rt)ki - k 2 {a + i])(pij -Pmin) a/ri . 

(15) 


where the constants ^2 , and k 4 are given by 

, _ a , _ ahk 2 (pij ~ p m in) a/ll ~ 2 

2 r °’ 3 [ki + k 2 {Pij — Pniin )'*/’ 7 ] 3 if ’ 

fc 4 = . 

Note that ^ 2 , k^ > 0; since p^ > p m [ n Vi, j, we also have 
^3 > 0. From (15), we see that, if a < rj, the second-order 
derivative is always nonpositive and then a t j (p ^) is a concave 
function of p ^. 

(ii) X 2 (•) is concave in p\ Exploiting the properties of the 
Laplacian [16], we can write the quadratic form associated to 
L as in (2). From (12), we also note that each coefficient aij 
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depends only on the corresponding link power pij, and not on 
the other link powers. Given any pair of power vectors p W and 
p( 2 \ let p = (dp W + (1 — f3)p( 2 \ with 0 < /3 < 1. Hence, we 
have 


x r TAa{v))x=- yp] ~ Xj)' 


1 


n n 


1 

2 


1=1 i=l 

n n 



1 

>- 

“2 


i=l 3 = 1 
n n 


a i] (PPif + (! - P)Pif) X ~ X j)‘ 



Xij ( pif 


i=lj=l 



-P)aij(pif) 


= (3x T L [a (+A) x + (1—/3 )x t L [a fp^J) x 


(16) 


where the inequality follows from the concavity of a,ij(pij). 

We further recall that the algebraic connectivity A 2 (L) is the 
solution of the following positive semidefinite program (SDP) 

[5]: 


A 2 (L) = min x T Lx. (17) 

aj_Ll,||aj||=l 

Hence, using (16), we obtain 

A 2 (l (/)p (1) + (1 - /3)/^ (2) )) 

>A 2 (/3L (p™) + {1 - p)L (p™)) 

> (3X2 (L (+))) + (1 - (3) A 2 (l (p (2) )) 

where the last equality follows from a majorization theorem for 
eigenvalues 4 . Thus, the algebraic connectivity is a concave func¬ 
tion of the vector of transmit powers. Hence, (14) is a convex- 
concave fractional problem since it is the ratio of a convex and 
a concave function of p. ■ 

Since (14) is a convex-concave fractional problem, we can use 
one of the methods that solve quasi-convex optimization prob¬ 
lems, see, e.g., [18], [19]. For example, we can use the nonlinear 
parametric formulation proposed in [19]. To do so, we introduce 
the following function: 


with x * E ©, if, and only if 

h* = h(p*,x*) = min {f(x) — p*g(x) : x E 0} = 0 

where h(p*,x*) means that for p = p* the minimum of 
{f(x) — p*g(x) : x E ©} is taken on at x = x*. 

Before applying Theorem 2, it is useful to further convert the 
convex-concave optimization problem [P.l] into the following 
parametric problem: 

min p T 1 — p\ 2 (L,{p)) 

P 

s.t. e < X 2 (L(p)) [P.2] 

lPmin ^ P- ( 18 ) 

By Theorem 1, A 2 (L(p)) is a concave function of p. Hence, the 
objective function in [P.2], as sum of convex functions, is a 
convex function. The constraint sets are convex. Then, problem 
[P.2] is a convex parametric problem, whose solution is a func¬ 
tion of the parameter p that controls the tradeoff between the 
global transmit power and the convergence time. Later on, we 
will show how to find the optimal p. 

Since problem [P.2] is convex, it can be solved using nu¬ 
merically efficient convex programming tools. However, before 
applying any convex tool, it is worth noticing that the feasible 
set in [P.2] is not compact. Hence, even if a solution exists, in 
principle, it could be unreachable in finite time. To overcome 
this potential drawback, we propose next an alternative formu¬ 
lation of [P.2]: Instead of looking for the set of power coeffi¬ 
cients pij , and then for the ap , using (13), we can reformulate 
[P.2] so as to look directly for the variables a t j . Then the opti¬ 
mization problem in (18) can be rewritten in terms of vector a, 
as follows: 


min (j>(a) — p\ 2 {L(a)) 

a 

s.t. e < A 2 (2/(a)) [P-3] 

0 < a < 1 (19) 

where <j>(a) = Y^j-i Y^j =1 We verify next that this 

problem is still convex. To study the behavior of (f>{a), we com¬ 
pute the second-order derivative of q(aij), obtaining 


h(p) = minjp^l — p\ 2 (L(p)) : p E T} 


where p is a real positive parameter and T = {p • p > 
IPmin? ^2(L(p)) ^ e}. In order to find the solution of this 
problem we could use the following result, proved in [19] and 
[ 20 ]. 

Theorem 2: Let f(x) and g(x) be continuous real-valued 
functions \/x E 0, where 0 is a nonempty compact subset of 
[R n and g(x) > 0 Vx E 0. Then 


p 


* 


f{x*) 

g(x*) 


= min 


fiX) 

g{x) 


x E 0 


4 Let A, B be Hermitian matrices, and let C — A + B. Let their eigen¬ 
values be sorted in nondecreasing order. Then the vector of eigenvalues A ( C) 
majorizes the vector A( A) + A(B). Since Ai(L) = 0 for the graph Laplacian, 
it follows that A 2 (A + B) > A 2 (A) + A 2 (B). See [17, Theorem 4.3.27]. 


d 2 q(a zj ) _ rj ( ctij \ v/a 2 1 

— H'l 


da 2 j 


a V 1 — a 


13 


(1 -a i3 Y \a 


--l + 2 n 


u } • 


We infer that 


d 2 q(ajj) 

dafj 


> 0 4 => T] — a + 2 QLCLij > 0. 


We note that, if p > a, then d > 0 for 0 < < 1, 

ij 

so that (j>(a), as a sum of convex functions, is convex 5 . Finally, 
the algebraic connectivity A 2 (L(a)) is a concave function of 
a, as can be proved following the same steps as in Theorem 

5 Note that cp(a) is the sum of functions of the single variables a LJ . Hence the 
convexity of <fr( a) can be studied looking at the convexity of the single functions 
q{a Z j). 
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1. Then, the optimization problem [P.3] is also a convex para¬ 
metric problem, perfectly equivalent to the original problem in 
(18), since the change of variables pij = q(aij) in (13) ensures 
a one-to-one mapping q : U —► U, for 0 < a,ij < 1 , with 
image covering the problem domain in (18) (see [21, p. 130]). 
Hence, assuming 0 < a < 1 — e , with e an infinitesimal posi¬ 
tive constant so that the feasible set of (19) T = {a : 0 < a < 
1 — e , A 2 (L(a)) > e} is a compact convex set in [R n ( n-1 )/ 2 , we 
ensure that the set of minima of (19) is nonempty and by the con¬ 
vexity of the problem we can deduce that all local minima are 
also global. Note that since the optimization problem in (19) is 
convex an optimal solution can be found via efficient numerical 
tools. Furthermore, using Dinkelbach’s algorithm [19], based 
on Theorem 2, we are also able to find the optimal parameter f i 
in [P.3]. More specifically, the Dinkelbach’s algorithm, applied 
to our problem, proceeds through the following steps: 

1. Set i = 1 and let a ? ; be a feasible point of T, with p n = 

H a i) 


(Zr(fti)) 

2. Set [i = (j n and find ai+i £ T that solves the minimization 
problem in [P.3]; 

3. If I h(n,Oi+ 1 ) |=| </>(o; + i)—/LtA 2 (X( a i+i)) l< e", withe" 
an arbitrarily small positive constant, stop and take a^i as 
the optimal link coefficient vector; otherwise, set i = i + 1 , 

= fffx) and 8° t0 ste P 2 ‘ 

Since the topology coefficients < 2 ^ obtained in this way are 
real variables belonging to the interval [ 0 , 1 ], to obtain the net¬ 
work topology, it is necessary to quantize them to convert them 
into binary values, 1 or 0 , indicating the presence or absence 
of a link. This quantization is achieved by comparing each aij 
with a threshold a t h- Of course, the final topology will depend 
on the threshold value. Moreover, the thresholding operation 
will also affect the final result in terms of convergence time 
and energy consumption. It is then of interest to check how 
sensitive the final topology, as well as convergence time and 
energy consumption, are to the choice of a t h- In the ensuing 
section, we present some numerical results to shed light on the 
resulting topologies and their dependence on the propagation 
model parameters. 




Fig. 1. Optimal average energy obtained with exhaustive search and average 
energy obtained by solving the relaxed problem [P.3] versus 77 , for different n 
values. 
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A. Numerical Examples 

Since our optimization procedure is based on a relaxation 
technique, the first important step is to evaluate the impact of 
the relaxation on the final topology and performance. 

Example 1: Comparison Between Exhaustive Search and Re¬ 
laxed Technique: We compare now the topology obtained as a 
solution of the relaxed problem [P.3] with the optimal graph 
obtained by solving directly problem [P.0] using an exhaustive 
search over all possible topologies. For complexity reasons, of 
course we can only perform this comparison for small scale net¬ 
works. We consider networks of n = 4 and 6 nodes. 

To provide results not conditioned to a specific geographic 
node deployment, we averaged the results over 100 statistically 
independent realizations of the nodes locations. In each iteration 
we compute the minimum energy E op reached using the optimal 
exhaustive search over all possible topologies, and the energy 
£ r corresponding to the network topology whose coefficients 
dij are obtained by solving problem [P.3] and thresholding the 


Fig. 2. Ratio between the optimal energy obtained with exhaustive search and 
the energy obtained by solving the proposed relaxed problem [P.3], versus a th , 
for different values of n . 


result with the threshold a t h = 2 x 10 -3 . In Fig. 1, we report the 
average energies E op and E r versus p. We can note from Fig. 1 
that the loss in terms of optimal energy due to the relaxation of 
the original problem is negligible (the energy loss is less than 
1.7 dB). 

Example 2: Impact of Thresholding Operation on Final 
Topology: Clearly, the selection of the threshold a t h plays a 
role in the identification of the final topology. To evaluate the 
impact of a t h on the final topology, in Fig. 2 we report the 
ratio 4^- versus a t h for different numbers of nodes n and for 

Cop 

p = 5. The energies are averaged over 100 independent node 
realizations. We can observe from Fig. 2 that there is a wide 
range of values of a t h suc h that the energy loss is practically 
independent of a t h. This shows that our proposed procedure is 
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Fig. 3. Optimal topologies, for different threshold values and fixed 77(77 = 

rather robust with respect to the choice of a t h* It is now inter¬ 
esting to check the effect of the threshold a t h on the topology, 
considering larger scale networks (in this case, we can only run 
our algorithm, as the exhaustive search is not feasible). As an 
example, in Fig. 3 we show the topologies obtained by solving 
problem [P.3], for a network composed of n = 30 nodes, using 
different values of a t h> for rj = 6 . Comparing the four cases 
reported in Fig. 3, we notice that, only for very low values of 
the threshold [i.e., case (d)], we appreciate a sensitive change 
of topology, whereas for a large range of values of a t h, the final 
topology is practically the same. This means that the proposed 
algorithm, in spite of the relaxation step and the subsequent 
quantization, yields rather stable solutions. 

The previous results pertain to a specific realization of the 
node locations. To provide results of more general validity, in 
Fig. 4, we report the average value of: a) fraction of active links 
7 _ I Ml , . 

nfa_i) ’ b) ^2 (L ); and c) the average energy S r , as a function 
of the threshold a t h- The averages are carried out over 100 inde¬ 
pendent realizations of the nodes location. From Fig. 4, we ob¬ 
serve that there is an interval of values of a t h (roughly, between 
10“ 5 to 10 “ 2 ) for which we obtain a strong reduction in the 
fraction of active links, with respect to the situation where there 
is no threshold, still achieving nearly the same performance, in 
terms of algebraic connectivity and energy consumption. This 
is indeed an important result, as it shows that the relaxed algo¬ 
rithm is weakly sensitive to the choice of a t h- 


6): (a) a th = 0.09. (b) a th — 0.05. (c) a th = 5 X 10 4 . (d) ci th = 10 7 . 






Fig. 4. Average value of (a) fraction of active links, (b) \ 2 (L). (c) energy 
versus threshold value for 77 — 6 . 


Example 3: Impact of Propagation Parameters on Final 
Topology: It is also interesting to look at the change in 
topology as a function of the radio communication model. To 
this end, in Fig. 5, we plot the optimal topologies achieved for 
the same node locations as in Fig. 3, but pertaining to different 
path loss exponents 77 , for a given threshold. Interestingly, we 
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Fig. 5. Optimal topologies, for different path loss exponents q and fixed threshold (a t h = 0.09): (a) q — 3. (b) q — 4. (c) q — 5. (d) // = 6. 


notice, as expected, that, as the propagation loss increases (i.e., 
77 increases), the network tends to become more and more 
sparse. 

The results shown in Fig. 5 refer to a single realization of the 
node positions. To draw conclusions of more general validity, 
we averaged over 100 statistically independent realizations of 
the node locations, for networks of 30 nodes. In Fig. 6 , we report 
the same performance metrics as in Fig. 4, but now as a function 
of the path loss exponent 77 , setting a t h = 0.09. From Fig. 6 , 
we observe that when the attenuation is high (i.e., 77 is large), 
reducing the number of links (making the topology sparser) is 
more important than reducing convergence time. Conversely, 
when the attenuation is low (i.e., 77 is small), increasing network 
connectivity is more important than reducing power consump¬ 
tion. This behavior sounds reasonable and in agreement with 
intuition. 

V. Optimal Topology for Random Geometric Networks 

In this section, we remove the assumption that the node lo¬ 
cations are known a priori and model the network as a random 
geometric graph (RGG). In such a case, the graph connectivity 
properties and the convergence time can only be established in 
a probabilistic sense, asymptotically, as the number of nodes 
tends to infinity. We refer to [22] for the first basic result about 
the convergence of consensus algorithms over random graphs 
and to [23] for a more recent generalization of the convergence 
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Fig. 6. Average value of (a) fraction of active links, (b) A 2 (L). (c) energy 
versus path loss q for a th = 0.09. 

conditions under stochastic disturbances. As shown in [22], 
the rate of convergence of consensus algorithms in the random 
graph case is dictated by the expected value E[e~ 2TsX2 &)] m 
In the following, exploiting the concentration properties of the 
eigenvalues of RGGs [24], [25], we will show how to relate 
the convergence time to the expected value of \ 2 (L). This link 
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will be fundamental to derive the optimal coverage radius, and 
then transmit power, that minimize the energy consumption 
necessary to achieve consensus over RGGs. To achieve this 
goal, it is fundamental to recall and extend some results about 
the spectrum of RGGs. 


A. RGGs 


A random graph is obtained by distributing n points randomly 
over the ^-dimensional space R d and connecting the nodes ac¬ 
cording to a given rule. Let V n = {xi,X 2 , ... ,x n } be a set 
of ^-dimensional vectors Xi , belonging to a compact set ft = 
[—1/2,1/2]^ of U d , whose entries denote the coordinates of the 
nodes in R d , and let || • || be the Euclidean norm on R d . The 
graph topology is captured by the adjacency matrix A which, 
in this case, is a random matrix. An important class of random 
matrices, encompassing the adjacency matrix of our problem, 
is the so-called Euclidean Random Matrix (ERM) class, intro¬ 
duced in [26]. Given a set of n points located at positions x^ i = 
1,..., rc, annxn adjacency matrix A is an ERM if its generic 
(z, j) entry depends only on the difference Xi — Xj , i.e., = 

F(x{ — Xj), where F is a measurable mapping from U d to R. 
An important subclass of ERM is given by the adjacency ma¬ 
trices of the so-called RGGs. In such a case, the entries a,ij of 
the adjacency matrix are either zero or one depending only on 
the distance between nodes i and j, i.e. 



if ||Xi — Xj 
otherwise 


< r 


( 20 ) 


where r is the coverage radius. This is a particular case of an 
ERM, corresponding to having 




if || Xi — Xj 
otherwise 


< r 


( 21 ) 


We will use the symbol G(n,r) to indicate an RGG composed 
of n points, with coverage radius r. 

The RGG model is the most appropriate to capture the 
topology of a wireless network, as it basically states that there 
is a link between two nodes only if they are within the coverage 
radius of each other 6 . 

Next, we recall some of the most important properties of 
RGG’s, in terms of connectivity and spectrum, as they are rele¬ 
vant to our optimization problem. 

1) Connectivity: Some interesting results on the asymptotic 
connectivity of random geometric graphs have been derived in a 
seminal work by Gupta and Kumar [27] who proved that, given 
a set of n points uniformly distributed within a unit square (i.e., 
d = 2), the graph is connected almost surely if the coverage 
radius behaves as 


r 0 (n) 


log(n-) + c n 
7 rn 


with c n —> cxD, as n —> oo. Conversely, if c n —► — oo, the graph 
is disconnected almost surely. This means that the expression 

lo ^ n ^ represents a threshold distance. In the following, we 

6 In practice, a wireless channel is also affected by fading, multipath, and shad¬ 
owing. Hence, the presence of a link between two nodes depends on two sources 
of randomness: distance and fading. In this work, we concentrate on the single 
source of randomness, given by the distance between the nodes, but considering 
both sources of randomness will be an interesting extension of this paper. 



will often use the shorthand notation r + (n) to indicate the law 

y / los( ^ + ^, with c n — > oo, as n — > oo. Hence, a coverage 
radius 7*0 (ft) behaving as r + (n) represents a law that ensures 
connectivity with high probability, as n —► 00. We will use the 
notation 7*0 (ft) ~ r + (n) to indicate such a behavior. 

In [4], [28], it has been shown that the degree of a RGG 
G(n, r) of points uniformly distributed over a two-dimensional 
unit torus 7 is equal to 


d(n) = 7r r 2 n (22) 

with high probability, i.e., with probability 1 — 1 /n 2 , if the radius 
behaves as 7*0(71) ~ r + (n). This implies that an RGG tends to 
behave, asymptotically, as a regular graph, if the coverage radius 
is chosen so as to guarantee connectivity with high probability. 

We are primarily interested in the second eigenvalue of the 
Laplacian, L = D — A, where D is the degree matrix and A is 
the adjacency matrix [see (1)]. From (22), D = irr 2 nl , so that 
we only need to investigate the second largest eigenvalue of A. 
Hence, in the ensuing section, we study the spectrum of A. 

2) Spectrum of a Random Geometric Graph: In [24], [25], 
it is shown that the eigenvalues of the adjacency matrix, or of 
the transition probability matrix 8 , tend to be concentrated, as 
the number of nodes tend to infinity. In particular, in [24] it is 
shown that the eigenvalues of the normalized adjacency matrix 
A n = A/n of an RGG G(ft, r ), composed of points uniformly 
distributed over a unit bidimensional torus, tend to the Fourier 
series coefficients of the function F defined in (21) 

F(z) = / exp (—2irjz T x)dx (23) 


almost surely, for all 2 = [21,22] E Z 2 , where ft r = \x = 
[xi,X 2 ] t E R 2 : \\x\\ < r}. Using polar coordinates, i.e., 
X\ — p sin 0 and X 2 = p cos 0, with 0 <p< r and 0 < 9 < 2tc, 
we obtain 


>277 


F(z) = 


exp (— 2tv jp{z\ sin 9 + 22 cos 9))pdpdO. 


This integral can be computed in closed form. Setting 21 = 
A sin f and 22 = A cos </>, we have 

nr p2ir — cf) 

F(A,(/>)= / / exp(—2 , irjpAco&(£))pdpd£ 

J 0 J—cf) 

with ^ = 9 — cf Furthermore, using the integral expression 
for the Bessel function of the first kind of order k, Jk(x) = 
If ex P U x MO - jhO d O we get 

nr 

F(A,(j>) = F(A) — 2i t J(f2'KpA)pdp. 

Jo 

Finally, using the identity vJo (v)dv = uJ\ (u), we can make 

/\ 

explicit the dependence of F(A) on the index pair [21,22] 

F(z 1 ,z 2 ) = , l 2 J\ Arfl + zf . (24) 

V z i + z 2 \ / 

7 A torus geometry is typically used to get rid of border effects. 

8 The transition probability matrix is the adjacency matrix, normalized with 
respect to the node degree, so that the ith row of the adjacency matrix is divided 
by the degree of node i. 
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Fig. 7. Verification of the inequalities in (26). 


This formula allows us to rank the eigenvalues of A n = Ajn. 
In particular, we are interested in the second largest eigen¬ 
value of A n . Considering that the minimum c overag e radius 

ensuring connectivity behaves as r(n ) ~ / lQ g(^) , • e ^ 

is a vanishing function of n, we can use the Taylor series 

/\ 

expansion of F( zx^zf), for small r. Recalling that, for small 
x , Ji(.'r) = x/2 — x 6 /16 + o(x 5 ), we can approximate the 
eigenvalues as 


+0(r «), (25) 
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Fig. 8. Asymptotic analysis and simulation results for some eigenvalues of A n 
and for A z(L n ) versus the transmission radius. 


shown in (22), the second largest eigenvalue of L n tends asymp¬ 
totically to 


M{Ln) = nr 2 - C2(A„). (29) 

Thus, the algebraic connectivity of the graph can be approxi¬ 
mated, asymptotically, as 


This expansion shows that, at least for small r, the largest eigen¬ 
value equals tt r 2 and occurs at z\ = Z 2 = 0, whereas the second 
largest eigenvalue corresponds to the cases (zi = 1, Z 2 = 0) 
and (zi = 0, Z 2 = 1). More generally, we can check numeri¬ 
cally that, for r < 1/2 and A > 1, the following inequalities 
hold true: 


7rr 2 > rJi(27rr) > ^—\ Ji(27rrA)|. (26) 

A 

The validity of these inequalities can be verified from Fig. 7, 
which shows the three terms in (26) as a function of r and A. 

In summary, denoting the spectral radius of A n as Ci(A*) = 
niaxi<,<„ , where {Ai(n)}” =1 is the set of eigenvalues 

of A , it follows that 


lim Ci(A,) 

n—* 00 


= max 

zGZ 2 



F( 0,0) = 7rr 2 


(27) 


while the second largest eigenvalue of A n) £ 2 (A n ), converges 
to 


lim C2 (A n ) = F( 1,0) = F( 0,1) = rJi(2nr). (28) 

n —>00 

We are now able to derive the asymptotic expression for the 
second largest eigenvalue of the normalized Laplacian L n = 
D n — A n , where D n := D/n is the normalized degree matrix. 
Because of the asymptotic property of the degree of an RGG, 


A 2 (L) = 7rnr 2 — nrJi(2rcr). (30) 

Since the previous expressions have been derived in the 
asymptotic regime, in Fig. 8 we compare the analytic formulas 
for the first and the second largest eigenvalues of A n , as given 
in (27) and (28), with the numerical results obtained by aver¬ 
aging over 100 independent realizations of RGG’s composed 
of n = 1024 nodes. We can notice the good agreement in 
Fig. 8 between the theoretical expression for the algebraic 
connectivity A 2 (L n ), given in (29), and the simulation results. 

B. Minimization of the Energy Consumption: An Analytic 
Approach 

We can now exploit the previous analytic expressions to study 
the energy minimization problem for RGG’s. In the random 
topology case, since the distances are unknown, we cannot op¬ 
timize the power associated with each link. However, we can 
seek the common transmit power that minimizes energy con¬ 
sumption. Thus, in the random setting we assume a broadcast 
communication model, where each node broadcasts the value 
to be shared with its neighbors. In the lack of any information 
about distances among the nodes, we assume that each node uses 
the same transmit power. In this case, the network topology can 
be modeled as a random graph model. It can be shown [22], 
[23] that the system in (3) converges to consensus almost surely, 
i.e., Pr{ lim x(t) = x*l } = 1 assuming that each node has a 

t—* 00 

coverage radius so that the network is asymptotically connected 
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with probability one [14]. Then the rate of convergence of the 
dynamical system in (3) is given [22], [23] by E[e~ 2TsX2 ^]. 
Note that defining /(A2) = e _2 T 3 A 2 (-L) anc [ us i n g the Taylor 
series expansion of /(A2) 9 at the point = E[ A2] = m\ 2 , we 
can write 


/(A 2 ) = f(m\ 2 ) + / (1) (m A2 )(A 2 - m\ 2 ) 

A 2 ) /a _ \2 


+ 


(A 2 - toa 2 ) 




k=3 


k\ 


(31) 


so that g(r) is a convex increasing positive function of r. Let us 
now study the behavior of A2(r). Using 

d[r m J m (0] m 7 / \ p n 1 0 

-—-= r J m _i(r) for m = 0,1,2,... 

we obtain 


A^fr) = —= 27 rrn[l - Jo(2wr)\ 

with \' 2 (r) > 0 for 7*0(n) < r < 0.5. Moreover, since 
dJ m (r) m 

—~j -= - 4 r - 4+1W for m = 0,1,2,... 

dr r 


Consequently, taking the expected value, we get 

e U( x 2 )} = f(m\ 2 ) + - — y^-ct 2 X2 + ... (32) 

denoting with a? 2 = E[(X 2 — mA 2 ) 2 ] the variance of A2. But 
since all central moments of order greater than one of the eigen¬ 
values tend to zero, because of the concentration property, we 
can use the approximation 

E[e~ 2 r»A 2 ] _ e -2T s B[A 2 ]_ ( 33 ) 


As a consequence, the energy spent to achieve consensus can 
now be approximated as 


£ = K 


np 

2E[X 2 Wp))Y 


(34) 


This is the performance metric we wish to minimize in the 
random scenario, with respect to the single unknown p. 

In particular, using the asymptotic expression (30) for the al¬ 
gebraic connectivity, we can introduce the following metric 

F(r \ = + {r/r 0 r\ 

nirr 2 — rrz.Ji( 27 rr) 


We now check, numerically, that the function £(r) given in (35) 
is a convex function of r, for 7*0(71) < r < 0.5, where ro(n) ~ 
r + (n), to ensure connectivity. 

Let us rewrite (35) as 



g(r) 

A 2 (r) 


with g(r) = np m - m [l + (■r/ro) r? ] and A2(r) as in (30). The first- 
and second-order derivatives of g(r) are, respectively 

,r]—l 


/ . dg(r) r 

9 {r) = — 7 — = nPminV —- > 0 


dr 


r 0 


r] 


g n (r) = 


d 2 g(r) 


r 


rj — 2 


= np m i n r)(r] - 1)—— > 0 


dr 2 ' “““ ’'' " T( t r i 

9 For simplicity we drop in A 2 the dependence on L. 


we have 



d 2 X 2 (r) 



27 rn[l — Jo( 27 rr)] + 47 r 2 rnJi( 27 rr). 

(36) 


Observe that the first term on the right-hand side (RHS) of (36) 
is always positive. Furthermore, if r < 1/2, the second term is 
also positive, since Ji(27rr) > Ofor r < 1/2. Hence, A 2 (r) > 0 
and we can conclude that the algebraic connectivity is an in¬ 
creasing and convex function of r for ro(n) < r < 0.5, where 
ro(n) ~ r + (n). 

We can now compute the first- and second-order derivatives 
of the energy function with respect to r. We get 

p' (\ = g'( r ) _ a 2 P0ff(O 
A 2 (r) A l(r) 

and, substituting the corresponding expressions, the extremal 
points can be obtained by solving the following nonlinear 
equation: 

£ (r) = ai (r) + ^(r) Jo(27rr) — a^(r) Ji(27rr) = 0 

with ai(r) = 7r[(?7 — 2)r r? — 2r^], a 2 (r) = 27r(r r? + r q 11 ) and 
a 3 (r) = gr 1-1 . Furthermore, the second derivative of £{r) is 
given by the equation shown at the bottom of the page. In Fig. 9, 
we report £ (r) as a function of r, for different values of g. 
From Fig. 9, we can check that the second derivative is always 
positive in the range of interest. This verifies that £{r) is in¬ 
deed a convex function of r defined on the compact convex set 
ro (n) < r < 0.5. As a consequence, we can state that there 
is always at least a radius r that globally minimizes the energy 
consumption in a RGG. 

Numerical examples. In Fig. 10, we compare the value of £(r) 
obtained by our theoretical approach and by simulation, for var¬ 
ious values of the path loss exponent, 2 < g < 6. The results 
are averaged over 100 independent realizations of random geo¬ 
metric graphs composed of n = 1000 nodes. For each g , the pair 
of radius and energy providing minimum energy consumption is 
indicated by a circle (simulation) or a star (theory). We observe 



(g"(r)\ 2 (r) - 2X' 2 (r)g (r))X 2 (r) + g(r)(2\' 2 2 (r) - A^(r)A 2 (f)) 

a 2 M 
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Fig. 9. Second derivative of the energy versus the transmission radius for sev¬ 
eral values of 77 . 
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Fig. 10. Global energy consumption versus transmission radius for an RGG; 
theoretical values (solid) and simulation (dashed). 


that the theoretical derivations provide a very good prediction 
of the performance achieved by simulation. Furthermore, for 
each 77, there is a coverage radius value that minimizes energy 
consumption. 

The optimal values of the coverage radius as a function of 77, 
as predicted by our theoretical derivations or by simulation, are 
reported in Fig. 11 . From this figure, we observe that there is 
a clear transition from the low power attenuation regime (i.e., 
77 < 3 . 5 ), where the optimal radius tends to make the network 
fully connected, as opposed to the strong attenuation situation 
(77 > 4 . 5 ), where the network is minimally connected. This 
means that in the low attenuation case, minimizing convergence 
time is more important than minimizing power consumption. 


mm iron 



Fig. 11. Optimal transmission radius versus 77 . 
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Fig. 12. Global energy consumption versus per node transmit power for an 
RGG; theoretical values (solid) and simulation (dashed). 

Conversely, in the large attenuation case, it is more important to 
minimize power consumption, by limiting the number of links 
as much as possible, compatibly with the constraint of ensuring 
connectivity. 

Fig. 12 shows the average energy consumption versus the per 
node transmit power 79, under the same settings of Fig. 10 . The 
circles (simulation) and the stars (theory) represent, again, the 
values of p that minimize energy consumption, for each 77. It 
can verified the existence of an optimal transmit power value 
minimizing the energy consumption. 

Finally, Fig. 13 shows the optimal per node power versus 77. 
For low values of 77, the transmit power increases with 77 because 
it must cope with higher attenuations to guarantee connectivity. 
Conversely, for large values of 77, the optimal power decreases 
with 77 because, in such a case, it is more beneficial to limit 
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Fig. 13. Optimal per node transmit power versus zy. 

the number of links, until the minimum power guaranteeing the 
connectivity constraint is reached. 

C. Random Topologies versus Uniformly Spaced Grids 

Finally, it is interesting to compare the energy consumption 
achievable with a random geometric graph and a deterministic 
grid. In the deterministic case, we may also distinguish between 
a graph with nodes scattered arbitrarily in a given area, but with 
positions known, and a regular uniform grid, where the points 
are located over a rectangular grid. 

1) Eigenvalues of a Planar Uniform Grid: We start by de¬ 
riving the algebraic connectivity of a square grid whose n nodes 
are uniformly spaced within a unit square, at a distance 8 = 
1/ni = 1 f y/n. To avoid undesired border effects, we consider 
the wrapping of the unit square in order to form a toroidal sur¬ 
face. Each node is assumed to have a link with the neighboring 
nodes only if they are at a distance less than a coverage radius 
r(p) 9 that depends on the transmit power p, as with geometric 
graphs. In Appendix A, we derive a closed form expression for 
the network degree and for the algebraic connectivity [see, e.g., 
(45)—(48)]. 

A numerical check of our derivations is reported in Fig. 14, 
where we show the numerical value of A 2 (L ), obtained through 
the eigendecomposition of L, and the value given in (48), for dif¬ 
ferent values of 77 . From Fig. 14, we can see a perfect agreement 
between our closed form expression and the numerical results. 
Notice, in particular, the sharp transition behavior: the eigen¬ 
values tend to n or 0 depending on whether the transmit power 
is above or below a threshold; further, the threshold increases 
with p. 

In Fig. 15, we compare the algebraic connectivity of the rect¬ 
angular grid, given by (48), with the theoretical value obtained 
for the random geometric graph, given in (30), assuming the 
same node density, over the same toroidal surface. As shown in 
previous works, see, e.g., [4], RGGs tend to behave asymptoti¬ 
cally as a regular graph. The result shown in Fig. 15 is a further 
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Fig. 14. Network algebraic connectivity versus per node transmit power for 
several values of the path loss coefficient. 


r; = J023 



Fig. 15. Algebraic connectivity of random geometric graph and rectangular 
grid versus transmission radius. 

confirmation of this property. In the same figure, we also re¬ 
port simulation results for the algebraic connectivity, obtained 
by averaging over 100 independent realizations of a RGG. We 
can check, once again, the good agreement between theory and 
simulation. 

Finally, in Fig. 16 we compare the energy consumption ob¬ 
tained assuming full a priori knowledge of the nodes’ locations 
or no knowledge at all. In the first case, the topology and the 
power allocation over each link are optimized according to the 
method illustrated in Section IV. The optimal values, for each 
77 , are indicated by colored dots. In the second case, we report 
(solid line) the energy consumption versus the average power, 
assuming that all nodes transmit with the same power (since they 
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Fig. 16. Average network energy consumption versus average network power. 

do not have any prior information about the other nodes posi¬ 
tions). To make a fair comparison between the two cases, we 
consider in both cases the sum of the powers associated with 
each active link. This means that, in the second case, the energy 
consumption is measured, for every common transmit power p , 

n 

p ^2 

as £ (p) = ^(p ) — > where m (p) denotes the number of neigh¬ 

bors of node i. The results shown in Fig. 16 have been obtained 
by averaging the results obtained in the two settings over the 
same set of 100 independent random deployments of n = 30 
nodes, uniformly distributed over the unit square. 

From Fig. 16 we can see that, as expected, the method as¬ 
suming perfect a priori knowledge of the node positions (in the 
figure denoted as £(p*)) is indeed able to achieve better per¬ 
formance than the other method (<f(p)), as it is able to take ad¬ 
vantage of the a priori knowledge. The optimal average power 
is the result of a tradeoff between transmit power and conver¬ 
gence time, and always depends on the path loss exponent. 

VI. Conclusion 

In this paper we have addressed the problem of finding the 
network topology that minimizes the energy consumption nec¬ 
essary to achieve consensus in a WSN. Assuming a simple flat 
fading propagation model, we have studied two main network 
models: 1) arbitrary networks, where the nodes are arbitrarily lo¬ 
cated, but with known positions, and 2) a random model, where 
the positions are not known a priori, but are modeled as random 
variables. In the first case, we have shown how to optimize the 
network topology and the power allocated to each active link 
in order to minimize the total energy necessary to achieve con¬ 
sensus within a prescribed accuracy. Topology optimization is, 
in general, a combinatorial problem and hence computationally 
demanding. To simplify the solution, we introduced a relaxation 
step that enabled us to reformulate the energy minimization 


problem as a convex-concave fractional program. This alterna¬ 
tive formulation can be cast as an equivalent parametric convex 
problem, which enables efficient solutions. The link weights re¬ 
sulting from the solution of the convex problem must then be 
thresholded to find out the network topology. We have shown 
through numerical results that the thresholding operation may 
yield a considerable reduction of the number of active links, yet 
with very limited effect on performance. 

In our deterministic setting, we assumed point-to-point links 
and we optimized the power over each link. In practice, a con¬ 
sensus algorithm running over a wireless network could ben¬ 
efit from the broadcast channel. Deciding between broadcast or 
one-to-one links entails a proper choice of the medium access 
strategy, to establish in which time slot each node has to listen 
to which broadcaster. 

Conversely, in the random network case, lacking any infor¬ 
mation about the internode distances, we assumed a broadcast 
communication strategy. In such a case, the network topology 
is modeled as a random geometric graph. We have derived 
closed form expressions, albeit valid only asymptotically, for 
the algebraic connectivity, assuming a common transmit power. 
Then, building on these expressions, we have shown that the 
energy consumption is a convex function of the coverage radius. 
We have also shown that a random geometric graph performs, 
asymptotically, as a regular graph built over a rectangular grid. 
This confirms previous results, although now in the context of 
energy minimization over consensus networks. 

Finally, we have compared the performance achievable with 
arbitrary and random graphs. Clearly, the knowledge of the 
node locations allows better power allocation, that translates 
into lower energy consumption to achieve consensus. However, 
in practice there is a price associated with the knowledge of 
node location. This knowledge requires the acquisition of 
the node positions first and then a centralized optimization. 
Conversely, the random approach can be followed also in a 
decentralized fashion, with only minimal information about 
some global parameters like number of nodes and area covered 
by the network, and it does not need any extra hardware or 
computation to acquire the nodes’ locations. 

In this paper, we have assumed a simple flat fading channel 
model, whose effect is only to introduce attenuation and su¬ 
perimpose noise. The simple model captures the essence of the 
problem and keeps the overall problem complexity under con¬ 
trol. However, looking at potential applications, it would be 
interesting to generalize the approach to the case where the 
channel model is more complicated. Furthermore, the network 
topology has been assumed to be static. However, a dynamic 
topology, possibly adapted to the consensus state, might pro¬ 
vide better performance. 


Appendix A 

In this section, we derive an analytical expression for the 
eigenvalues of the adjacency matrix A of a rectangular grid over 
a unit torus, as a function of the coverage radius r(p). We as¬ 
sume, for simplicity that the number of nodes n is a square 
number, i.e., n = n\, with m integer. We number the rows of 


















36 


the grid, 1 to rai, going from bottom to top. The n x n matrix A 
can be expressed in a cyclic block form, as 



Aq 

Ai 

A2 

j 

• • • Jrlj n 1 

-2 

A ni _ 1 


— 1 

Ao 

Ai 

J 

• • • - rL ni 

-3 

An 1 —2 

A = 

Aii —2 

A ni _i 

Ao 

J 

• • • 1 

-4 

A ni —3 


- A! 

A2 

A3 

J 

... ^x ni 

-1 

Ao 


where each block Ai, for i = 0 ,..., n\ — 1 , is an rii x n\ 
circulant matrix. In particular, the block Ao is the adjacency 
matrix between the nodes lying on the same row of the grid, 
starting from the leftmost node and proceeding along the row. 
Ai represents the adjacency matrix between the nodes of row j 
and the nodes lying on the (z + j)th row, for i = 1 ,..., m — j, 
and on the (z + j — 7 ii)th row for i = n\ — j + 1 ,..., ni — 1 . 
The toroidal structure, used to avoid border effects, reflects into 
the cyclic structure of A, so that we have A ni _ v ; = Ai if 


i = (m — l)/2 T 1,..., rii — 1 and n\ odd 
z = ni/2 + 1 ,..., ni — 1 and n i even. 


We also note that A is a block circulant matrix with circulant 
blocks (BCCB). Since each block Ai for z = 0,..., n\ — 1 is 
anni x n \-dimensional circulant matrix, the eigenvalues of Ai 
admit the following decomposition 

/V t—f /V' 

A. — I? a F 

**-1 — 1 m n i 


with 



1 



1 

LU 




l w"- 1 - 1 


U 


n i —2 


• • • 


LJ 


n i — 1 


• • • 


UJ 


where u = exp (—j27r/ni); is a diagonal matrix whose en¬ 
tries are the eigenvalues of Ai, i.e., the discrete Fourier trans¬ 
form (DFT) of the first row of A ? ;. Hence, the matrix A admits 
the following diagonalization: 


for /, m = 0 ,..., ni — 1 and the coefficient (p) is kth entry 
of the first row of Ai for fc, i = 0,..., n\ — 1. Equivalently, (37) 
can be seen as the two-dimensional DFT of a matrix A whose 

/“V /V 

rows A(z,:) are given by A(z,:) = A^(l,:) for z = 0 ,..., m — 
1, applying the 1-DFT to the rows and then to the columns of A. 

In general, for a regular grid where each node has degree d(p), 
the algebraic connectivity can be expressed as 

A 2 (L(j>)) = d(r(p)) 

n i — l n\ — 1 

- 53 53 exp(-j2irk/m) (38) 

1=0 k =0 

where the coefficients a^ } ( p ) can be expressed as follows. Let 
u(x) denote the step function, i.e., u(x) = 1 for x > 0 and zero 
otherwise. For m odd, we have 

4 0) (p) = 0 and 

a^ l \p) = u(r(p) - ib) i = 1,..., (m - l)/2, (39) 

a^\p) = u(r{p) — kb) and 

a k\p) — u(r(p) - Vi 2 + k 2 b) i, fc = 1 ,..., (ni — l )/2 

(40) 

a k\p) = a nl-k(p) k - ( n i _ l )/2 + 1 , • * • ,«i - 1 and 
i = 0,...,(m - l)/ 2 . (41) 

Finally A; = A ni _ v ; for z = (ni — l)/2 + 1,..., n\ — 1. 
Similarly, in the case of n\ even, we have 

a o°\p) = 0 and 


U) 

% 

(p) 

= u{r{jp) — i6) 

z = 1,..., rii/2, 


(42) 

a (0) 

a k 

(p) 

= u(r(p) — kb) 

and 



(i) 

(p) 

= u(r(p) — 

! + k 2 b) z, fc = 1,... 

, n i/2, 

(43) 

(i) 

a fc 

(p) 

= a nl-k(P) k 

— 1^/2 ~F 1 7 7 77- 2 . — 

1 and 



z 

= 0,..., rzi/2 



(44) 

with 

A^ 

— An t — 2 if ^ — 

ni/2 + 1,..., m — 1. 




Finally, we can derive analytical expressions of the network 
degree and of the algebraic connectivity for m odd as 


A = (F ni <g> F ni ) h A(F ni O F ni ) 
where ( 8 ) denotes the Kronecker product and 

ri\ —1 

A= E 

k =0 


with ft ni = diag[l, a;,..., cn ni_1 ]. The entries A/ ?m (A) of the 
matrix A can be also expressed, after a few algebraic manipu¬ 
lations, as 


n i — l ni — 1 

Al,m(A) = 53 53 43 (p) 

2=0 fc = 0 

x exp (— j2'Kklfni) exp (—j2irmi/ni) (37) 



and 

A 2 (i<p)) 


(m-i)/2 

2a fc° } W 

fc=l 

(m-l)/2 / (m-l)/2 

+ 2 13 a o l) (p)+ 2 X 3 a fc } 

i=i y fc=i 

(45) 

(m—1)/2 

^ 2a8(p)(l — cos(27rfc/ni)) 
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and for m even 
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2 = 1 

+ 2 4 0 1 /2b) + 2 4?/ / 2 2) b) 

ni/2— 1 ni/2—1 

- 4 E £ 4* } (p) cos(2irk/rii). (48) 

2=1 fe=l 

For example, in the case where r(p) < \/2<5, i.e., assuming a 
regular grid of degree d(p) = 4, we obtain 


A ni _i = Ai = I and Ai = O Vi = 2,..., n\ — 2 (49) 

while Aq is a circulant matrix whose first row is the n\ -di¬ 
mensional vector [0100... 001]. Then, the eigenvalues 
are given by A/(Aq) = 2 cos(27i7/ni) and applying the 

formula in (37) the eigenvalues of A are A/ ?m (A) = 

2 cos(27i7/ni) + 2 cos(27rra/ni) for /, m = 0,...,ni — 1. 
The second largest eigenvalue of A is obtained for l = 1 and 
m = 0 and is given by Ai,o(A) = 2cos(27r/ni) + 2. Thus 
the eigenvalues of the Laplacian matrix can be expressed as 
A i(L) = 4 — A ? ;(A) and the algebraic connectivity of the 
network is given by A 2 (L) = 2 — 2 cos(27r/ni). 
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Abstract. The availability of both global and regional el¬ 
evation datasets acquired by modern remote sensing tech¬ 
nologies provides an opportunity to significantly improve the 
accuracy of stream mapping, especially in remote, hard to 
reach regions. Stream extraction from digital elevation mod¬ 
els (DEMs) is based on computation of flow accumulation, 
a summary parameter that poses performance and accuracy 
challenges when applied to large, noisy DEMs generated 
by remote sensing technologies. Robust handling of DEM 
depressions is essential for reliable extraction of connected 
drainage networks from this type of data. The least-cost flow 
routing method implemented in GRASS GIS as the mod¬ 
ule r.watershed was redesigned to significantly improve its 
speed, functionality, and memory requirements and make it 
an efficient tool for stream mapping and watershed analysis 
from large DEMs. To evaluate its handling of large depres¬ 
sions, typical for remote sensing derived DEMs, three differ¬ 
ent methods were compared: traditional sink filling, impact 
reduction approach, and least-cost path search. The compari¬ 
son was performed using the Shuttle Radar Topographic Mis¬ 
sion (SRTM) and Interferometric Synthetic Aperture Radar 
for Elevation (IFSARE) datasets covering central Panama at 
90 m and 10 m resolutions, respectively. The accuracy as¬ 
sessment was based on ground control points acquired by 
GPS and reference points digitized from Landsat imagery 
along segments of selected Panamanian rivers. The results 
demonstrate that the new implementation of the least-cost 
path method is significantly faster than the original version, 
can cope with massive datasets, and provides the most ac¬ 
curate results in terms of stream locations validated against 
reference points. 


Correspondence to: M. Metz 
(markus.metz@uni-ulm.de) 


1 Introduction 

Shuttle Radar Topographic Mission (SRTM; Farr et al., 
2007) and various airborne Interferometric Synthetic Aper¬ 
ture Radar for Elevation (IFSARE) surveys provide a new 
generation of elevation data in regions that have had only 
limited, often low resolution coverage. These topographic 
data sets are increasingly used to improve mapping of geo- 
morphic and hydrologic features, especially in remote, hard 
to reach areas and at regional to global scales (e.g., Kinner 
et al., 2005; Lehner and Doll, 2004; World Wildlife Fund, 
2009). 

In spite of significant advances in the development of flow 
routing algorithms (e.g. Quinn et al., 1991; Costa-Cabral and 
Burges, 1994; Holmgren, 1994; Quinn etal., 1995; Tarboton, 
1997), accurate extraction of drainage networks from Digi¬ 
tal Elevation Models (DEMs) generated by remote sensing 
technologies such as SRTM or IFSARE remains challeng¬ 
ing. To fully understand the problem, it is important to real¬ 
ize that the mapped elevation surfaces include the top of the 
forest canopy, as well as anthropogenic features, rather than 
the ground surface required for flow routing. Although ho¬ 
mogeneous forest canopy generally follows the shape of the 
ground surface, gaps in vegetation that can stretch over hun¬ 
dreds of meters create large, often nested, depressions that 
pose difficulties for flow routing. In addition to these large 
depressions, the surface over forested areas is noisy, creating 
numerous small depressions and barriers that further com¬ 
plicate drainage network extraction. Therefore, one of the 
important questions investigated in this paper was whether 
such data are suitable for drainage network extraction at all 
and if yes, how accurate are the extracted drainage networks 
and which methods provide the most reliable results. 

At the same time, the broad availability of elevation data 
dramatically increased the extent of regions that can be ana¬ 
lyzed at relatively high resolutions, given the computational 
capabilities that can support processing of massive DEMs. 
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Therefore, significant effort has been devoted to the devel¬ 
opment of new flow tracing and watershed analysis algo¬ 
rithms that support efficient processing of large DEMs and 
address the issue of routing through complex nested depres¬ 
sions (e.g. Rivix Limited Liability Company, 2001; Arge et 
ah, 2003; Danner et ah, 2007). 

The most widespread method for handling depressions is 
sink filling, up to the level of the sink spill point, com¬ 
bined with routing through the resulting flat area (Jenson and 
Domingue, 1988; Lairfield and Leymarie, 1991; Planchon 
and Darboux, 2001; Wang and Liu, 2006). Improved sink 
filling methods (e.g. Garbrecht and Martz, 1997; Grimaldi 
et al., 2007; Santini et al., 2009) first fill sinks, and then in¬ 
troduce a gradient to all flat areas to provide non-zero gra¬ 
dients for flow routing. The method complementary to sink 
filling is carving or breaching (Rieger, 1998; Martz and Gar¬ 
brecht, 1998) where a channel is carved out of each sink, 
breaking through the (artificial) obstacle. Both principles can 
be combined in an impact reduction approach (IRA; Lindsay 
and Creed, 2005) that for each sink determines the method 
that causes the least impact on the source dataset. All these 
approaches alter the elevation data in order to ensure full 
drainage, assuming that sinks are artifacts created either by 
too low (artificial pits) or too high (artificial drainage blocks) 
elevation values. 

An alternative to the modification of elevation data is to 
determine the least-cost drainage paths (LCP) through un¬ 
altered terrain and out of sinks. LCP search methods were 
generally designed to find the shortest or fastest route from a 
starting point to a given destination, used for example in car 
navigation devices or to generate cumulative cost surfaces. A 
particular LCP search method (A* Search; Hart et al., 1968) 
was adapted for flow routing and watershed analysis and im¬ 
plemented as the module r.watershed (Ehlschlaeger, 1989) in 
GRASS GIS (Neteler and Mitasova, 2008, see GRASS De¬ 
velopment Team 2010 for binaries and source code). When 
applied to ILSARE derived DEMs, this approach has pro¬ 
vided more accurate flow routing through large, nested de¬ 
pressions with fewer artifacts than traditional sink filling 
(Kinner et al., 2005). However, the module was not de¬ 
signed to handle the massive DEMs that are currently avail¬ 
able (Arge et al., 2003). To render the module suitable for 
analysis of very large datasets at ever higher resolutions, and 
to reduce the artifacts in flow patterns due to the single flow 
direction (SLD) algorithm (Quinn et al., 1991; Holmgren, 
1994; Tarboton, 1997), redesign of the implementation and 
addition of more flow routing options was necessary. 

In this study, we present substantial improvements to the 
LCP method implementation and evaluation of its efficiency 
and accuracy when applied to radar-based DEMs. We com¬ 
pare the performance of the improved module with (1) its 
original implementation, (2) a module based on a disk in¬ 
put/output (I/O) efficient method (Arge et al., 2003), and 
(3) the IRA implementation in the Terrain Analysis System 
(TAS, Lindsay and Creed, 2005). We also analyze the impact 


of mapping technology (ILSARE, SRTM), resolution, and 
DEM resampling on the accuracy of the extracted drainage 
networks. Although the LCP method has been used since 
early 90ies, the literature that describes the algorithm, its 
implementation and properties is very limited (Ehlschlaeger, 
1989). This paper aims to fill this gap in literature and to 
highlight the value of this method and its improvements for 
analysis of modem DEMs. 

2 Methods 

2.1 Improved least cost path search algorithm 

The LCP algorithm (A* Search, Hart et al., 1968; 
Ehlschlaeger, 1989) used for flow routing, flow accumula¬ 
tion, and watershed analysis of raster-based DEMs was re¬ 
designed to increase the processing speed and decrease mem¬ 
ory consumption. The implementation in the GRASS mod¬ 
ule r.watershed (Fig. 1) starts with potential outlet points. 
Natural ultimate outlets are e.g. river mouths opening into 
oceans or lakes without outflow. On gridded elevation mod¬ 
els, potential outlets are grid cells along the map boundaries 
or cells with at least one neighbor with unknown elevation, 
e.g. masked ocean. All potential outlet grid points are in¬ 
serted into a list sorted by costs. Costs are measured as eleva¬ 
tion and order of addition to the sorted list (grid cells added 
earlier have higher precedence in case of equal elevation). 
For the actual search, the grid cell with the lowest eleva¬ 
tion (smallest cost) is extracted and removed from the sorted 
list, marked as processed and its neighbors are investigated. 
This causes the search to proceed along the least steep up¬ 
hill slope. At each step during the search, only neighboring 
grid cells that are not yet in the search list and not yet pro¬ 
cessed are added to the list, and drainage direction for these 
neighboring grid cells is set towards the current grid cell. If 
a depression is encountered, the search follows the steepest 
downhill slope to the bottom of a depression and then pro¬ 
ceeds again along the least steep uphill slope. The search 
proceeds in this manner upstream along the least steep slope 
and terminates when all grid points have been processed. The 
LCP search provides two results: (a) flow direction for each 
cell in a standard D8 manner (O’Callaghan and Mark, 1984), 
and (b) the order in which cells must be processed for flow 
accumulation. 

The search component of the original implementation was 
modified by augmenting it with a minimum heap used as pri¬ 
ority queue (Atkinson et al., 1986; Metz and Ehlschlaeger, 
2010), which led to a substantial speed improvement. In 
the original implementation, the time needed to keep the 
sorted list sorted increased exponentially with the number 
of grid cells, while in the new implementation this time in¬ 
crease is logarithmic. Removal of redundant information in 
intermediate data as well as reduced memory requirements 
for both the search and the flow accumulation components 
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Fig. 1. Flowchart illustrating the least cost path search algorithm. 



Fig. 2. SRTM and IFSARE coverage of Panama. GPS ground control points are displayed in black. 


have further enhanced the module’s capacity to process larger 
DEMs. For massive datasets that can not be processed with 
the amount of memory available, the module can optionally 
use external memory with intermediate data stored on disk. 
These changes, aimed at improving the computational per¬ 
formance, did not affect the quality of the results. 

To improve the flow routing accuracy and to reduce arti¬ 
facts in the flow accumulation pattern, a multiple flow di¬ 
rection (MFD) algorithm, based on Holmgren (1994), was 
implemented using the order determined by the LCP search 
with an option to control the strength of flow convergence. 
For grid-based elevation models, an MFD approach con¬ 
forms better to theoretical surface flow dispersion than single 
flow direction in which only one neighboring cell can receive 
surface flow from the current cell (Freeman, 1991; Quinn et 
ah, 1991; Holmgren, 1994; Tarboton, 1997). Initial D8 flow 
directions as determined by the LCP search are used to route 
flow out of depressions when all unprocessed neighboring 
grid cells have higher elevation than the current grid cell. 


2.2 Elevation data 

Two different sources of elevation data were used to demon¬ 
strate the improvements in the LCP method (Fig. 2). Coun¬ 
trywide elevation coverage of Panama was available as a 
3 arc second resolution SRTM DEM (Farr et ah, 2007). 
SRTM version 2.1 provided by the NASA Jet Propul¬ 
sion Laboratory (http://www2.jpl.nasa.gov/srtm/, download 
at http://dds.cr.usgs.gov/srtm/) was selected for this study as 
the most reliable version in terms of accuracy and minimal 
artifacts, after evaluating properties of the SRTM products 
available at the time of writing (vl, v2.1, v3, v4.1). The 
published absolute vertical error of the version 2.1 SRTM 
DEM is 6.2 m for South America including Panama (Farr 
et ah, 2007), although several studies report higher accura¬ 
cies (see e.g. Rodriguez et ah, 2006). A recent airborne IF¬ 
SARE survey has provided new, more detailed information 
about the topography in central Panama at 10 m resolution, 
using technology with published vertical accuracy of ^3 m 
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Fig. 3. Shaded relief of a Landsat false color composite (R,G,B = 
4,5,3) overlaid with GPS reference points in red and points digitized 
from Landsat in grey. 

(Andersen et al., 2006). IFSARE (InterFerometric Synthetic 
Aperture Radar - Elevation) is derived in a way very simi¬ 
lar to SRTM, with the main difference that IFSARE is air¬ 
borne and SRTM data was recorded with sensors onboard 
the Space Shuttle Endeavour. Both IFSARE and SRTM used 
two antennas to collect radar data; elevation information was 
derived by analyzing the data recorded by the two antennas 
(the general concept of the new satellite-borne TanDEM-X 
mission is similar, collecting radar data with two antennas). 
Neither of the surveys penetrated the forest canopy to the 
ground surface and the elevation surface was over large re¬ 
gions defined by a triple canopy tropical forest environment 
with tree heights of more than 30 m above ground. Given 
the widespread use of the SRTM data, one of the important 
questions explored in this paper was whether such data are 
suitable for stream extraction at all, and if yes, what is the ac¬ 
curacy and which methods provide the most reliable results 
in this challenging environment. 

SRTM v2 tiles covering all of Panama were combined and 
gaps in the dataset filled using the regularized spline with ten¬ 
sion (RST) interpolation method (GRASS module rfillnulls ; 
Mitasova and Mitas, 1993). The seamless SRTM coverage 
was then reprojected with bicubic interpolation from geo¬ 
graphic to the UTM zone 17N coordinate system with 90 m 
resolution to keep reprojection modifications to a minimum. 
To evaluate the impact of resampling on stream extraction 
accuracy (see e.g. Valeriano et al., 2006) and to generate 
data for additional performance testing, the reprojected 90 m 
SRTM DEM was reinterpolated to 30 m resolution using the 
RST method. 


The IFSARE data were provided as a 10 m resolution 
DEM (Kinner et al., 2005) in the UTM zone 17N coordi¬ 
nate system and did not require additional processing. For 
additional testing purposes, the IFSARE DEM was down- 
sampled to 30 m resolution using bicubic interpolation in or¬ 
der to have DEMs from two sources (SRTM and IFSARE) 
with different level of detail, but identical resolution of 30 m. 

2.3 Stream location data 

Two sets of data points were used to evaluate horizontal accu¬ 
racy of the drainage network extraction methods: (i) stream 
segments digitized from Landsat imagery (TM 5, year 2000, 
scene idLT50120542000087XXX02, provided by the United 
States Geological Survey - USGS) and (ii) GPS field mea¬ 
surements. Bands 3, 4, and 5 of the Landsat TM scene were 
used at its native resolution of 30 m. The geographic projec¬ 
tion for this study (UTM 17 North) was identical to the pro¬ 
jection of this Landsat scene, i.e. the Landsat grid geometry 
and values were not modified. A Landsat false color com¬ 
posite with Red, Green, and Blue assigned to the channels 
4, 5, and 3 respectively, clearly separated vegetation from 
waterbodies and provided the background for manual digiti¬ 
zation of points along the river centerlines. Only streams and 
rivers with a width less or equal to 4 grid cells (120 m) were 
digitized in order to exclude lakes, anabranching or braided 
rivers, and rivers too broad to reliably determine a stream 
center using Landsat imagery (Fig. 3). The horizontal ac¬ 
curacy of these manually digitized reference points is given 
by the imagery resolution of 30 m or better, depending on 
the stream width and the location of the stream centerline in 
relation to the grid cells. 

GPS points were collected in the field using a Corval¬ 
lis Microtechnology March v3.7 GPS unit with ~2m posi¬ 
tional accuracy. Sites in the Chagres river watershed were 
measured during the years 2002 to 2007 and sites at lower 
reaches of most major rivers across Panama were measured 
during 2005 and 2009 years. GPS points were collected 
along clearly identifiable perennial rivers at locations where 
a reliable GPS signal was available. The GPS measurements 
included a larger proportion of points acquired in mountain¬ 
ous regions along smaller rivers, while most of the points 
located along larger rivers in low-gradient flood plain and 
coastal plain landscapes were digitized from Landsat im¬ 
agery (Fig. 3). 

The SRTM water body database was not used for the ac¬ 
curacy evaluation because this cannot be regarded as an in¬ 
dependent data source since it was used in the creation of the 
SRTM DEM v2, and because even the larger rivers visible on 
Landsat imagery were still too narrow to be captured in the 
SRTM water body database. 

We have also considered and rejected the use of hydrog¬ 
raphy (blue lines) from topographic maps because of their 
low reliability and a relatively coarse scale of 1:50 000. Sev¬ 
eral studies that have compared the blue lines with on-ground 
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Fig. 4. Extraction of a single main channel: (A) Location of areas shown in images (B), (C), and (D); (B) The starting points of the extracted 
streams (red lines) were well upstream of the reference points (black crosses). Flow accumulation derived from IFSARE 10 m is used as 
background. The extracted streams (red lines) are located within broader stream tubes in low-gradient areas close to the coast, for both (C) 
LCP-derived streams and (D) streams derived with sink filling. IFSARE 10 m was used for both (C) and (D) representing the same area. 


surveys of existing streams demonstrated that the blue lines 
were the least accurate source of stream location informa¬ 
tion (see e.g. Colson, 2006 and the references therein), of¬ 
ten based on old data digitized from aerial photographs with 
large errors in locations where the streams were not visible, 
due to the presence of vegetation. In fact, one of the major 
motivations for deriving the drainage network from the IF¬ 
SARE data in Panama was the inadequate scale of available 
blue lines with many larger streams missing and with errors 
in the mapped streams location. Until recently, the relative 
inaccessibility of some regions (e.g. the upper Chagres river) 
and an incessant cloud cover have discouraged mapping in 
these areas (Kinner et al., 2005). 

2.4 Flow accumulation and evaluation of extracted 
drainage networks 

Several tests were performed to evaluate the tested flow rout¬ 
ing algorithms. The first test evaluated the processing speed 
and capability to handle large DEMs when computing flow 
accumulation maps. In this test, the performance of the new 
LCP implementation was compared with the old r.watershed 
module (GRASS Development Team, 2008) and with a rel¬ 
atively new, I/O-efficient algorithm, used in the module 


r.terraflow (Arge et al., 2003). All computations were done 
on the same hardware (AMD Athlon X2 3 GHz and 8 GB 
RAM) but different operating systems (Linux 64 bit for the 
GRASS modules r.watershed and r.terraflow , and Windows 
XP 32 bit for TAS GIS which is a 32 bit application for Mi¬ 
crosoft Windows and was not available as 64 bit application 
at the time of writing). 

The second test evaluated the horizontal accuracy of the 
extracted drainage paths for algorithms with different han¬ 
dling of depressions. Three approaches were tested: (i) sink 
filling as implemented in r.terraflow, (ii) the impact reduction 
approach (IRA) as implemented in TAS GIS (Lindsay and 
Creed, 2005), and (iii) the LCP algorithm as implemented in 
the enhanced version of r.watershed in GRASS GIS 6.4. To 
test the sink filling method, r. terraflow was used because of 
its capability to efficiently process massive DEMs. 

The third test evaluated the impact of DEM resolution 
and level of detail on the horizontal accuracy of the ex¬ 
tracted drainage paths for all tested algorithms {r.watershed, 
r.terraflow , and TAS). 

The tests were performed for the central Panama region 
using both the IFSARE and SRTM data at the 10 m, 30 m 
and 90 m resolutions by computing flow accumulation with 
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MFD dispersal and extracting drainage networks from the 
computed flow accumulation. 

Drainage networks were extracted from all MFD flow ac¬ 
cumulation maps using a minimum upstream catchment area 
of 100 000 m 2 as a threshold. This threshold was selected 
for testing purposes and to ensure that all measured or dig¬ 
itized points were located downstream from the extraction 
starting point and that all points had a drainage path associ¬ 
ated with it (Fig. 4b). Accurate identification of the channel 
head zones was beyond the scope of this paper because it re¬ 
quires more than elevation data as it is often significantly in¬ 
fluenced by local geology, groundwater level, and land cover 
(North Carolina Division of Water Quality NCWQ, 2010). 
According to the NCWQ (2010) study, “the stream origins 
usually occur as transition zones in which the location and 
length of the zone is subject to fluctuations in groundwater 
levels and precipitation”. The tested 90 m and 30 m resolu¬ 
tion SRTM DEMs measured over at least 30 m high canopy 
of dense tropical forest did not provide sufficient informa¬ 
tion for accurate identification and mapping of the dynamic 
channel heads zones. Therefore our focus was on mapping 
streams and rivers that have sufficiently large contributing 
areas that the existence of a well defined stream can be ex¬ 
pected. 

In order to extract a single main channel in broader flood- 
plains where MFD flow accumulation generates several cells 
wide stream tubes that may include cells above the given 
threshold, a new stream starting point was defined only if 
it was not located within such a stream tube (Fig. 4). Down¬ 
stream channel tracing followed the steepest slope and max¬ 
imum flow accumulation within the stream tube. To egress 
from depressions using the LCP method, streams again fol¬ 
lowed the maximum flow accumulation which was in these 
cases identical to the drainage direction initially determined 
during the LCP search. As a result, the drainage paths ex¬ 
tracted from MFD flow accumulation were a single grid cell 
wide and were easily vectorized for further analysis. This 
approach of extracting single cell wide dendritic and topo¬ 
logically correct drainage networks from flow accumulation 
obtained with any kind of flow distribution method avoids 
the problems associated with skeletonizing rasterized MFD 
channels. The MFD capabilities to map flow dispersal in 
floodplains can then be preserved and single flow direction 
(SFD) routing is not necessary for flow accumulation and 
subsequent channel extraction (although it remains an op¬ 
tion). 

Assessment of drainage network extraction accuracy 
was then performed by calculating distances between the 
drainage paths derived from the IF SARE and SRTM DEMs 
and the points measured along the streams, consisting of 338 
on-ground GPS measurements and 995 points digitized along 
rivers from the Landsat imagery (Figs. 3,4). 

In order to determine which method or level of detail pro¬ 
vided more accurate results, we determined the distance of 
reference points to the nearest extracted drainage path. The 


difference in the distance to reference points between meth¬ 
ods or level of detail was then used to assess which method 
or level of detail provided more accurate results. Distance 
differences were tested with two-tailed Student’s t-tests for 
statistical significance and a = 0.05. If distance differences 
were significantly different from zero, i.e. one of the tested 
methods/levels of detail was more accurate than the other, the 
sign of the difference indicated which method/level of detail 
was more accurate. 


3 Results 

3.1 Performance comparison 

To demonstrate the improvement in computational perfor¬ 
mance of the new LCP method implementation, the process¬ 
ing times needed by the old and new version of r.watershed , 
and rterraflow were compared, with the results summarized 
in Table 1. For flow accumulation computation, the new 
version was 350 times faster than the old version for the 
central Panama area represented by a relatively small DEM 
with 2 million grid cells at 90 m resolution. The improve¬ 
ment was even more dramatic for the countrywide coverage 
at 90 m resolution with 27 million grid cells: the new version 
was about 1750 times faster. Although absolute processing 
times are dependent on the specific hardware used for test¬ 
ing, we expect that relative time differences will be simi¬ 
lar on other systems because the software optimizations are 
hardware-independent. Processing the larger regions used 
in this study with the old version was impractical because 
it could easily take days (Arge et al., 2003), whereas the new 
version required only minutes to execute. The computational 
time needed by the I/O-efficient r.terraflow was significantly 
lower than that for the old r.watershed , but the rterraflow 
module was not as fast as the new LCP implementation on 
our test system (Table 1). We have also considered the case 
when the data do not fit into memory and r.watershed uses 
segmented processing, with intermediate data being stored 
on disk. This leads to longer processing times than for the 
all-in-memory mode. Processing times observed on our test¬ 
ing system in the segmented mode were still shorter than for 
r.terraflow , which uses I/O-efficient algorithms specifically 
designed for large datasets (Arge et al., 2003). The size of 
intermediate data created by r.watershed in the segmented 
mode was less than 20% of those created by r.terraflow. This 
can be of advantage if intermediate data are just a bit too large 
to fit into memory. It is important to note that, theoretically, 
r.terraflow with its advanced I/O-eflicient algorithms should 
be faster than r.watershed for DEMs much larger than the 
ones processed here and on systems with less physical mem¬ 
ory. 

TAS GIS was not included in the performance compar¬ 
isons, because its memory requirements for processing the 
10 m IFSARE DEM and the 30 m SRTM DEM for all of 
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Table 1. Processing time for the different DEMs on a Linux 64 bit system with an AMD Athlon X2 3 GHz CPU and 8 GB RAM. 



SRTM 90 m 
Central Panama 
1.9 million cells 

IFSARE 30 m 
central Panama 
17 million cells 

SRTM 90 m 
all of Panama 

27 million cells 

IFSARE 10 m 
central Panama 
156 million cells 

SRTM 30 m 
all of Panama 

241 million cells 

old r.watershed , 
all in memory 

23.2min 

23 h 10 min 

30 h 46 min 

> 2 days 
not measured 

> 2 days 
not measured 

new r.watershed , 
all in memory 

0.07 min 

0.83 min 

1.05 min 

9.97 min 

12.3 min 

new r.watershed , 
data on disk 

0.15 min 

1.73 min 

2.05 min 

30.7 min 

32.18 min 

r.terraflow 

0.45 min 

4.4 min 

5.05 min 

66.95 min 

58.7 min 


Table 2. Median distances of reference points to nearest extracted 
stream in meters. 



GPS 

filling 

IRA 

LCP 

Landsat 

filling 

IRA 

LCP 

IFSARE 10 m 

62.65 

N/A* 

34.40 

43.75 

N/A* 

39.25 

IFSARE 30 m 

77.15 

71.42 

70.71 

48.88 

48.27 

47.75 

SRTM 30 m 

99.78 

92.46 

89.13 

84.25 

82.81 

83.72 

SRTM 90 m 

124.84 

119.69 

121.00 

94.30 

94.69 

94.92 


* Computation cancelled by software with out of memory error. 
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Fig. 5. Mean and standard deviation of the distance differences be¬ 
tween sink filling and least-cost path search. A positive distance dif¬ 
ference indicates that streams extracted with least-cost path search 
are closer to reference points than streams extracted with sink fill¬ 
ing. 


Fig. 6. Mean and standard deviation of the distance differences 
between impact reduction approach and least-cost path. A posi¬ 
tive distance difference indicates that streams extracted with least- 
cost path search are closer to reference points than streams extracted 
with the impact reduction approach. 


lize, hence the out-of-memory error was regarded as a lim¬ 
itation of this application and not as a hardware limitation. 
Therefore, TAS was used only for the sink treatment meth¬ 
ods in comparison with the 30 m and 90 m DEM covering 
the central Panama subregion (Fig. 2) and results of the IRA 
method were not available for 10 m IFSARE. At the time of 
writing, IRA was not included in WhiteboxGAT (Lindsay, 
2009) that has replaced TAS. 

3.2 Comparison between different methods of 
sink treatment 


Panama exceeded available system memory. TAS GIS is a 
32 bit application for Microsoft Windows and can only uti¬ 
lize as much memory as a 32 bit application can manage, 
i.e. 3 GB. As opposed to r.terraflow and r.watershed , TAS 
GIS keeps all intermediate data in memory, thus limiting the 
size of the DEMs that can be processed. The test system pro¬ 
vided as much memory as TAS GIS could theoretically uti- 


To evaluate the impact of different approaches in the han¬ 
dling of elevation surface depressions, median distances of 
reference points to extracted drainage paths for each sink 
treatment method and each processed DEM were computed 
and the results are summarized in Table 2. As expected, the 
location of extracted drainage paths improved with increas¬ 
ing detail available in the DEM. While the drainage paths for 
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LCP 


Sink filling 


IRA 


'u 

GPS field points Landsat 


(t-test, 7337 = —2.361, = 0.018) and 30 m SRTM (t-test, 

7337 = —3.088, p = 0.002), but for the 90 m SRTM the dif¬ 
ference was not significant (t-test, £337 = —1.745, p = 0.08). 
For the points digitized from Landsat, drainage paths ex¬ 
tracted from LCP accumulation were not closer than IRA 
drainage paths for any of the tested DEMs (t-tests, all ab¬ 
solute 7)94 < 1.0, all p > 0.3). 

3.3 Comparison between different levels of detail 
at constant resolution 



Fig. 7. Mean and standard deviation of the distance differences 
between the 30 m SRTM DEM and the 30 m IFSARE DEM. A pos¬ 
itive distance difference indicates that streams extracted from the 
30 m IFSARE DEM are closer to reference points than streams ex¬ 
tracted from the 30 m SRTM DEM. LCP: least-cost path search; 
IRA: impact reduction approach. 


LCP 

Sink filling 
IRA 


TD ° 

GPS field points Landsat 

Fig. 8. Mean and standard deviation of the distance differences be¬ 
tween the 90 m and the 30 m SRTM DEM. A positive distance dif¬ 
ference indicates that streams extracted from the 30 m SRTM DEM 
are closer to reference points than streams extracted from the 90 m 
SRTM DEM. LCP: least-cost path search; IRA: impact reduction 
approach. 



the 90 m SRTM DEM were about 100 m away from refer¬ 
ence points, the median distances improved down to 34.4 m 
for the IFSARE 10 m DEM. 

Drainage paths extracted from the LCP flow accumulation 
were closer to the GPS field points (t-tests, all 7337 < —5.4, all 
p < 0.0001) for all DEMs at all tested resolutions (IFSARE 
10 m, 30 m, and SRTM 30 m, 90 m) when compared with the 
sink filling method (Fig. 5 showing bar graphs with mean and 
standard deviation of the distance differences). The drainage 
paths extracted from LCP were also closer to the points dig¬ 
itized from Landsat for all DEMs (t-tests, all £994 < —2.3, 
all p < 0.0001), except for the 90 m SRTM where the dis¬ 
tance difference was not significant for (t-test, £994 = — 1.213, 
p = 0.23). 

When compared to the IRA method (Fig. 6 showing bar 
graphs with mean and standard deviation of the distance dif¬ 
ferences), drainage paths extracted from LCP flow accumula¬ 
tion were closer to the GPS field points for the 30 m IFSARE 


The 10 m IFSARE and 90 m SRTM were interpolated to the 
same resolution of 30 m. We investigated how much the dif¬ 
ferent levels of detail at identical resolution influenced the ac¬ 
curacy of extracted drainage paths for all three sink treatment 
methods (Fig. 7 showing bar graphs with mean and standard 
deviation of the distance differences). The distance between 
the reference points and drainage paths extracted from the 
30 m IFSARE DEM was subtracted from the distance be¬ 
tween the reference points and drainage paths extracted from 
the 30 m SRTM DEM to measure the difference in accuracy 
of drainage paths extracted from these two DEMs. For larger 
rivers in flat areas (Landsat points), the accuracy of drainage 
path locations was considerably higher for the 30 m IFSARE 
DEM than for the 30 m SRTM DEM (t-test, all £994 < —13.0, 
all p < 0.001; on average 35 m closer) for all methods. For 
smaller rivers not broader than 30 m in mountainous regions 
(GPS points), only IRA showed a significant improvement 
in accuracy from the 30 m SRTM DEM to the 30 m IFSARE 
DEM (t-test, 7337 = —3.315, p = 0.001, all other £337 > —1.9 
and <0, p > 0.05). However, as shown in the previous sec¬ 
tion for the 30 m IFSARE DEM, IRA drainage path locations 
in mountainous regions were significantly less accurate than 
LCP drainage path locations. 

3.4 Effects of RST resampling on the accuracy of 
drainage network extraction 

In order to assess the effect of SRTM resampling from 90 m 
to 30 m resolution by the RST method, the drainage paths 
extracted from the 90 m SRTM DEM were compared with 
drainage paths extracted from the resampled 30 m SRTM 
DEM by calculating the difference in drainage path distances 
to reference points (Fig. 8 showing bar graphs with mean and 
standard deviation of the distance differences). Correspond¬ 
ing median distances of reference points to drainage path lo¬ 
cations are given in Table 2. The accuracy of drainage path 
locations was improved by resampling for larger rivers in flat 
landscape for all sink treatment methods (t-tests, all £994 < 
—6.5, all p < 0.001), however, for GPS points in mountain¬ 
ous regions only the results obtained by the least-cost path 
search were improved (t-test, *337 = —3.734, p < 0.001, all 
other 7337 > — 1 and <0, p > 0.3). 
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Fig. 9. A typical example for the impact of sink filling along a section of a river course extracted from the 30 m SRTM DEM. Along this 
section, 92% of all elevation values were modified. Parts of the river course obscured by trees are highlighted in the GoogleEarth sreenshot. 



Fig. 10. Typical effects of different sink treatment methods on extracted stream courses in (A) mountainous areas and (B) costal plains. Filled 
sinks are grayed. Streams extracted from sink-filling in red are overlaid by streams extracted from least-cost path search in blue. Points of 
confluences (yellow cross) in mountainous areas can be shifted considerably, and in coastal plains, sink filling can lead to long straight lines. 
A shaded relief of the 10 m IF S ARE DEM was used as background. 


4 Discussion 

Digital Terrain Models, as an approximation of real terrain at 
a given level of detail, include random noise and errors. For 
example, an in depth analysis of SRTM accuracy and source 
of errors is provided by Rodriguez et al. (2006) and Farr et 
al. (2007). The errors can have significant impact on the re¬ 
sults of DEM analysis and their treatment has been the focus 
of broad research (e.g. Jenson and Domingue, 1988; Lindsay 
and Creed, 2005; Hutchinson, 2000). Sink (depression) re¬ 
moval was designed to remove small depressions introduced 
as artifacts of elevation data processing for the first genera¬ 
tion of DEMs to facilitate continuous flow routing (Jenson 
and Domingue, 1988). This procedure, referred to as hydro- 
logical conditioning, has become widespread even for next 
generations of elevation data and is now a pre-processing 


step required by a number of applications for hydrological 
analysis and modeling. In order to alleviate the problem of 
routing surface flow through the flat areas created by sink 
filling, efforts have been made to add a gradient to these flat 
areas (e.g. Martz and Garbrecht, 1998; Wang and Liu, 2006; 
Grimaldi et al., 2007; Santini et al., 2009) Nevertheless, as 
modern high resolution data such as LiDAR demonstrate, 
actual terrain includes many true depressions, especially in 
riffle-and-pool portions of mountainous rivers, and on flood- 
plains and coastal plains (Notebaert et al., 2009). There¬ 
fore, sink filling is not always an appropriate approach for 
hydrological conditioning of a DEM because the measured 
elevation data are replaced with new elevation values based 
on the rather unrealistic assumption that the terrain has no 
depressions (Jenson and Domingue, 1988; Tarboton, 1997), 
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be they artificial or real. Although many hydrologic mod¬ 
els require depression-less DEMs, removal of sinks can alter 
the true hydrological state that includes standing water in de¬ 
pressions. Moreover, in IFSARE and SRTM DEMs that in¬ 
clude vegetation cover, many of the observed depressions are 
caused by gaps in canopy, canopy closing over river courses, 
or by vegetation partially obstructing narrow and deep val¬ 
leys. The filling of such features requires altering elevations 
by several meters and over areas of hundreds of m 2 , leading 
to large flats and an artificial drainage network geometry that 
does not represent reality. Figure 9 presents an example of 
the barriers across the river channel created by overhanging 
trees and a profile that illustrates the extensive modification 
in elevation that can result from such filling. Preserving most 
of the drainage directions of the original DEM is preferable 
to hydrological conditioning in this case. 

Our results suggest that the adverse effects of sink filling 
become more pronounced for higher resolution DEMs (see 
Fig. 10 for drainage paths extracted from 10 m IFSARE). 
By contrast, at lower resolutions such as 90 m, the coarsest 
resolution tested in this study, there was often no significant 
difference in drainage path locations between the compared 
methods, particularly for Landsat reference points. Land- 
sat points where predominantly located in flood plain and 
coastal plain landscapes where both the horizontal and the 
vertical resolution of 90 m SRTM were often not sufficient to 
locate rivers and no method was able to accurately delineate 
drainage paths in these areas from 90 m SRTM (Hancock et 
al., 2006; Valeriano et al., 2006). 

When comparing SRTM and IFSARE DEMs at the same 
resolution of 30 m, the drainage networks extracted from 
the IFSARE DEM were, as expected, more accurate than 
drainage networks extracted from SRTM DEM, particularly 
for Landsat points in relatively flat areas. As shown in previ¬ 
ous studies (Hancock et al., 2006; Valeriano et al., 2006), 
the SRTM DEM close to its native resolution of 3 arc sec 
does not provide sufficient vertical detail to determine river 
courses in flat areas. The applied RST resampling method 
significantly improved the accuracy of extracted drainage 
path locations for SRTM in regions with low topography. 
Apparently the level of detail has been not just increased but 
also improved because resampling to a higher horizontal (and 
for SRTM also higher vertical) resolution seems to be help¬ 
ful in improving the results of watershed analysis from low- 
resolution DEMs (see also Valeriano et al., 2006). The global 
SRTM coverage at 3 arc sec horizontal resolution has been 
created by averaging the original 1 arc sec coverage (Farr et 
al., 2007). Depending on the method used, reinterpolating 
the 3 arc sec version back to 1 arc sec can result in narrowed 
valleys and ridges and increased channel sinuosity. Interest¬ 
ingly, only the LCP search method benefited from the reinter¬ 
polation in mountainous regions and provided here the most 
accurate drainage path locations for the 30 m SRTM DEM. 

It is important to note that the differences in the tested 
methods were significant due to the particularly challenging 


study region. Smaller differences between the methods could 
be expected in topography with gentle slopes and uniformly 
low vegetation cover. Although network connectivity, topol¬ 
ogy and channel morphometry has not been investigated, the 
visual inspection of the results indicates that there were dif¬ 
ferences between the results obtained by different methods 
and resolutions in the coastal plain areas. None of the tested 
DEMs or methods produced sufficiently reliable results and 
higher resolution did not always lead to correct topology. 
These results are similar to those found for LiDAR-based 
DEMs (Colson, 2006) and require further research. 

Although the tests presented here demonstrated the per¬ 
formance of the LCP method for radar-based data, the new, 
improved version of r.watershed has in fact been developed 
using LiDAR data provided by the USGS in addition to the 
1 m LiDAR DEM examples and data provided by Neteler and 
Mitasova (2008). The presented method processes a DEM 
without modifying its elevation values and is thus to a de¬ 
gree (quantitatively narrowed by this study) dependent on 
the reliability of the provided DEM. The LCP search ap¬ 
proach, despite its capability to route flow through depres¬ 
sions, will therefore, like other methods, fail to accurately 
extract drainage networks if the input DEM deviates too far 
from real topography. 

5 Conclusions 

We presented a method for fast computation of flow accu¬ 
mulation and drainage network extraction for large DEMs 
with nested depressions and evaluated it against other com¬ 
monly used methods for sink treatment. The accuracy assess¬ 
ment using ground control points and Landsat satellite im¬ 
agery provided insight into the accuracy of drainage network 
extraction from radar elevation data in a challenging tropi¬ 
cal forest environment and coastal plain setting. The results 
suggest that the conceptually simple sink filling approach is 
not suitable for the IFSARE and SRTM DEMs in regions 
with significant vegetation cover whereas both the tested im¬ 
pact reduction approach of Lindsay and Creed (2005) and the 
LCP search provide more accurate drainage networks. 

The performance testing has demonstrated that the new 
implementation dramatically improves computational effi¬ 
ciency while preserving the high accuracy of the LCP rout¬ 
ing capabilities. The increase in computational performance 
is particularly relevant for the modem mapping technologies 
that can rapidly produce massive DEMs at high resolutions 
for large areas and the slow processing and analysis has be¬ 
come bottleneck in their application. Fast processing of mas¬ 
sive DEMs opens their application to a new level of detail and 
spatial extent, for example in rapid response operations or for 
mapping in remote regions where the streams are covered by 
dense vegetation and no reliable stream data are available. 
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Abstract —We address the percolation-based connectivity of 
large-scale ad hoc heterogeneous wireless networks, where sec¬ 
ondary users exploit channels temporarily unused by primary 
users and the existence of a communication link between two sec¬ 
ondary users depends on not only the distance between them but 
also the transmitting and receiving activities of nearby primary 
users. We introduce the concept of connectivity region defined as 
the set of density pairs—the density of secondary users and the 
density of primary transmitters — under which the secondary 
network is connected. Using theories and techniques from con¬ 
tinuum percolation, we analytically characterize the connectivity 
region of the secondary network and reveal the tradeoff be¬ 
tween proximity (the number of neighbors) and the occurrence 
of spectrum opportunities. Specifically, we establish three basic 
properties of the connectivity region—contiguity, monotonicity of 
the boundary and uniqueness of the infinite connected component, 
where the uniqueness implies the occurrence of a phase transition 
phenomenon in terms of the almost sure existence of either zero 
or one infinite connected component; we identify and analyze 
two critical densities which jointly specify the profile as well as 
an outer bound on the connectivity region; we study the im¬ 
pacts of secondary users’ transmission power on the connectivity 
region and the conditional average degree of a secondary user 
and demonstrate that matching the interference ranges of the 
primary and the secondary networks maximizes the tolerance of 
the secondary network to the primary traffic load. Furthermore, 
we establish a necessary condition and a sufficient condition for 
connectivity, which lead to an outer bound and an inner bound on 
the connectivity region. 

Index Terms —Cognitive radio, connectivity region, continuum 
percolation, critical densities, heterogeneous wireless network, 
phase transition. 


I. Introduction 

T HE communication infrastructure is becoming increas¬ 
ingly heterogeneous, with a dynamic composition of in¬ 
terdependent, interactive and hierarchical network components 
with different priorities and service requirements. One example 
is the cognitive radio technology [1] for opportunistic spectrum 
access which adopts a hierarchical structure for resource sharing 
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[2] . Specifically, a secondary network is overlaid with a pri¬ 
mary network, where secondary users identify and exploit tem¬ 
porarily and locally unused channels without causing unaccept¬ 
able interference to primary users [2]. 

A. Connectivity and Connectivity Region 

While the connectivity of homogeneous ad hoc networks con¬ 
sisting of peer users has been well studied (see, for example, 

[3] —[10]), little is known about the connectivity of heteroge¬ 
neous networks. The problem is fundamentally different from 
its counterpart in homogeneous networks. In particular, the con¬ 
nectivity of the low-priority network component depends on the 
characteristics (traffic pattern/load, topology, interference toler¬ 
ance, etc.) of the high-priority component, thus creating a much 
more diverse and complex design space. 

Due to the hierarchical structure of spectrum sharing, a com¬ 
munication link exists between two secondary users if the fol¬ 
lowing two conditions hold: (Cl) they are within each other’s 
transmission range; (C2) they see a spectrum opportunity de¬ 
termined by the transmitting and receiving activities of nearby 
primary users (see Section II-B-I). We say that a topological link 
exists between two secondary users if only condition (Cl) holds. 

Using theories and techniques from continuum percolation, 
we analytically characterize the percolation-based connectivity 
of the secondary network in a large-scale ad hoc heterogeneous 
network. Specifically, we consider a Poisson distributed sec¬ 
ondary network overlaid with a Poisson distributed primary 
network in an infinite two-dimensional Euclidean space 1 . We 
define network connectivity as the existence of an infinite 
connected component formed by communication links almost 
surely (a.s.), i.e., the occurrence of percolation. This connec¬ 
tivity characterizes the accessibility of two secondary users at 
a given instant and is thus also called instantaneous mutual 
connectivity. Given the transmission power and the interfer¬ 
ence tolerance of both the primary and the secondary users, the 
connectivity of the secondary network depends on the density 
of secondary users [due to (Cl)] and the traffic load of primary 
users [due to (C2)]. 

We thus introduce the concept of connectivity region C, de¬ 
fined as the set of density pairs (As, Apt ) under which the sec¬ 
ondary network is connected, where As denotes the density of 
the secondary users and A pt the density of primary transmitters 
(representing the traffic load of the primary users). As illustrated 
in Fig. 1, a secondary network with a density pair (As, Apt) in¬ 
side the shaded region is connected: the secondary network has 

'This infinite network model is equivalent in distribution to the limit of a 
sequence of finite networks with a fixed density as the area of the network in¬ 
creases to infinity, i.e., the so-called extended network [11]. It follows from the 
arguments similar to the ones used in [12, Chapter 3] for homogeneous ad hoc 
networks that this infinite ad hoc heterogeneous network model represents the 
limiting behavior of large-scale networks. 


55 



Fig. 1. Connectivity region C (the upper boundary Ap T ( A s ) is defined as the 
supremum density of the primary transmitters that ensures connectivity with a 
fixed density of the secondary users; the critical density AJ of the secondary 
users is defined as the infimum density of the secondary users that ensures con¬ 
nectivity under a positive density of the primary transmitters; the critical density 
Ap T of the primary transmitters is the supremum density of the primary trans¬ 
mitters that ensures connectivity with a finite density of the secondary users). 

a giant connected component which includes infinite number of 
secondary users. The existence of the giant connected compo¬ 
nent enables bidirectional communications between distant sec¬ 
ondary users via multihop relaying. On the other hand, a sec¬ 
ondary network with a density pair (As, App) outside this re¬ 
gion is not connected: the network is separated into an infinite 
number of finite connected components. Consequently, any sec¬ 
ondary user can only communicate with users within a limited 
range. 

It is shown in [13] and [14] that when the primary network is 
static, this instantaneous connectivity is equivalent to the con¬ 
nectivity defined in terms of the finiteness of the minimum mul¬ 
tihop delay (referred to as fd-connectivity); when the primary 
traffic has temporal dynamics (no matter how small the range 
of the dynamics is), this instantaneous connectivity is stronger 
than the fd-connectivity and the secondary network can be in¬ 
termittently connected as long as it is topologically connected 
[13], [14]. 

The objective of this paper is to establish analytical charac¬ 
terizations of the connectivity region and to study the impact of 
system design parameters (in particular, the transmission power 
of the secondary users) on network connectivity. The main re¬ 
sults are summarized in the subsequent two subsections. 

B. Analytical Characterizations of the Connectivity Region 

We first establish three basic properties of the connectivity re¬ 
gion: contiguity, monotonicity of the boundary and uniqueness 
of the infinite connected component. Specifically, based on a 
coupling argument, we show that the connectivity region is a 
contiguous area bounded below by the As-axis and bounded 
above by a monotonically increasing function X PT (Xs) (see 
Fig. 1), where the upper boundary X PT (Xs) is defined as 

Ap r (As) = sup{App : (/(As, Xpt) is connected} (1) 

with (/(As, Xpt) denoting the secondary network of density As 
overlaid with a primary network specified by the density Xpt 
of the primary transmitters. The uniqueness of the infinite con¬ 
nected component is established based on ergodic theory and 
certain combinatorial results. It shows that once the secondary 


network is connected, it contains a unique infinite connected 
component a.s. 

Second, we identify and analyze two critical parameters of 
the connectivity region: X* s and X PT . They jointly specify the 
profile as well as an outer bound on the connectivity region. Re¬ 
ferred to as the critical density of the secondary users, A^ is the 
infimum density of the secondary users that ensures connectivity 
under a positive density of the primary transmitters 

X't; = inf{As : 3A pt > 0 s.t. (/(As, Xpt) is connected}. 

( 2 ) 

We show that X* s equals the critical density A c of a homogeneous 
ad hoc network (i.e., in the absence of primary users), which 
has been well studied [15]. This result shows that the “takeoff’ 
point in the connectivity region is completely determined by the 
effect of proximity—the number of neighbors (nodes within the 
transmission range of a secondary user). 

Referred to as the critical density of the primary transmitters, 
Ap r is the supremum density of the primary transmitters that 
ensures the connectivity of the secondary network with a finite 
density of the secondary users: 

App = sup{App : 3Xs < 00 s.t. (/(As, Xpt) is connected}. 

_ (3) 

We obtain an upper bound on App, which is shown to be achiev¬ 
able in simulations. More importantly, this result shows that 
when the density of the primary transmitters is higher than the 
(finite) value given by this upper bound, the secondary network 
cannot be connected no matter how dense it is. This parameter 
App thus characterizes the impact of opportunity occurrence 
on the connectivity of the secondary network: when the den¬ 
sity of the primary transmitters is beyond a certain level, there 
are simply not enough spectrum opportunities for any secondary 
network to be connected. 

Since a precise characterization of the upper boundary 
App (As) of the connectivity region is intractable, we establish 
a necessary and a sufficient condition for connectivity to pro¬ 
vide an outer and an inner bound on the connectivity region. 
The necessary condition is expressed in the form of the condi¬ 
tional average degree of a secondary user and is derived by the 
construction of a branching process. The sufficient condition 
is obtained by the discretization of the continuum percolation 
model into a dependent site percolation model. 

C. Impact of Transmission Power on Connectivity: Proximity 
Versus Opportunity 

The study of the impact of the secondary users’ transmis¬ 
sion power p tx on network connectivity reveals an interesting 
tradeoff between proximity and opportunity in the design of 
heterogeneous networks. As illustrated in Fig. 2, we show 
that increasing p tx enlarges the connectivity region C along 
the As-axis (i.e., better proximity leads to a smaller “takeoff’ 
point), but at the price of reducing C along the App-axis. 
Specifically, with a large p tx , few secondary users experience 
spectrum opportunities due to their large interference footprint 
with respect to the primary users. This leads to a poor tolerance 
to the primary traffic load parameterized by App. 

The transmission power p tx of the secondary network should 
thus be chosen according to the operating point of the hetero- 
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Fig. 2. Simulated connectivity regions for two different transmission powers 
( p tx denotes the transmission power of the secondary users and the large p tx is 
3“ times the small p tx , where a is the path-loss exponent). 

geneous network given by the density of the secondary users 
and the traffic load of the co-existing primary users. Using the 
tolerance to the primary traffic load as the performance mea¬ 
sure, we show that the interference range rj of the secondary 
users should be equal to the interference range Rp of the pri¬ 
mary users in order to maximize the upper bound on the critical 
density \* PT of the primary transmitters. Given the interference 
tolerance of the primary and secondary users, we can then de¬ 
sign the optimal transmission power p tx of the secondary users 
based on that of the primary users. 

D. Related Work 

To our best knowledge, the connectivity of large-scale ad hoc 
heterogeneous networks has not been characterized analytically 
or experimentally in the literature. There are a number of classic 
results on the connectivity of homogeneous ad hoc networks. 
For example, it has been shown that to ensure either 1-connec¬ 
tivity (there exists a path between any pair of nodes) [3], [4] 
or ^-connectivity (there exist at least k node-disjoint paths be¬ 
tween any pair of nodes) [5], the average number of neighbors 
of each node must increase with the network size. On the other 
hand, to maintain a weaker connectivity—p-connectivity (i.e., 
the probability that any pair of nodes is connected is at least p ), 
the average number of neighbors is only required to be above a 
certain “magic number’ which does not depend on the network 
size [ 6 ]. 

The theory of continuum percolation has been used by 
Dousse et al. in analyzing the connectivity of a homogeneous 
ad hoc network under the worst-case mutual interference [7], 
[ 8 ]. In [9], [10], the connectivity and the transmission delay in 
a homogeneous ad hoc network with static or dynamic on-off 
links are investigated from a percolation-based perspective. 

In [13] and [14], the impact of temporal dynamics of primary 
traffic on the connectivity and multihop delay of the secondary 
network is studied. In particular, it is shown that in the presence 
of primary traffic dynamics, the secondary network is connected 


(either instantaneously or intermittently) as long as it is topolog¬ 
ically connected. Optimal power control in heterogeneous net¬ 
works has been studied in [16], which focuses on a single pair of 
secondary users in a Poisson network of primary users. The im¬ 
pacts of secondary users’ transmission power on the occurrence 
of spectrum opportunities and the reliability of opportunity de¬ 
tection are analytically characterized. 

E. Organization and Notations 

The rest of this paper is organized as follows. Section II 
presents the Poisson model of the heterogeneous network. In 
particular, the conditions for the existence of a communication 
link in the secondary network is specified based on a rigorous 
definition of spectrum opportunity. In Section III, we introduce 
the concept of connectivity region and establish its three basic 
properties. The two critical densities are analyzed, followed 
by a necessary and a sufficient condition for connectivity. In 
Section IV, we demonstrate the tradeoff between proximity 
and opportunity by studying the impact of the secondary users’ 
transmission power on the connectivity region and on the 
conditional degree of a secondary user. Further, we obtain the 
optimal transmission power of the secondary users, adopting 
the secondary network’s tolerance of the primary traffic load as 
the performance measure. Section V contains detailed proofs 
of the main results and Section VI concludes the paper. 

Throughout the paper, we use capital letters for parameters 
of the primary users and lowercase letters for those of the sec¬ 
ondary users. 

II. Network Model 

We consider a Poisson distributed secondary network over¬ 
laid with a Poisson distributed primary network in an infinite 
two-dimensional Euclidean space. The models of the primary 
and secondary networks are specified in the following two sub¬ 
sections. 

A. The Primary Network 

The primary transmitters are distributed according to a two- 
dimensional Poisson point process with density App• Each pri¬ 
mary transmitter’s receiver is uniformly distributed within its 
transmission range R p . Here we have assumed that all primary 
transmitters use the same transmission power and the trans¬ 
mitted signals undergo isotropic path loss. Based on the dis¬ 
placement theorem [17, Chapter 5], it is easy to see that the pri¬ 
mary receivers form a two-dimensional Poisson point process 
with density App. Note that the two Poisson processes formed 
by the primary transmitters and receivers are correlated. 

B. The Secondary Network 

The secondary users are distributed according to a two-di¬ 
mensional Poisson point process with density A 5 , independent 
of the Poisson processes of the primary transmitters and re¬ 
ceivers. The transmission range of the secondary users is de¬ 
noted by r p . 

Communication Links: In contrast to the case in a homoge¬ 
neous network, the existence of a communication link between 
two secondary users depends on not only the distance between 
them but also the availability of the communication channel 
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Fig. 3. Definition of spectrum opportunity. 


(i.e., the presence of a spectrum opportunity). The latter is deter¬ 
mined by the transmitting and receiving activities in the primary 
network as described below. 

As illustrated in Fig. 3, there exists an opportunity from A, 
the secondary transmitter, to B , the secondary receiver, if the 
transmission from A does not interfere with nearby primary re¬ 
ceivers in the solid circle and the reception at B is not affected 
by nearby primary transmitters in the dashed circle [18]. Re¬ 
ferred to as the interference range of the secondary users, the 
radius 77 of the solid circle at A depends on the transmission 
power of A and the interference tolerance of the primary re¬ 
ceivers, whereas the radius Rj of the dashed circle (the interfer¬ 
ence range of the primary users 2 ) depends on the transmission 
power of the primary users and the interference tolerance of B. 

It is clear from the above discussion that spectrum opportuni¬ 
ties depend on both transmitting and receiving activities of the 
primary users. Furthermore, spectrum opportunities are a,sym¬ 
metric. Specifically, a channel that is an opportunity when A 
is the transmitter and B the receiver may not be an opportu¬ 
nity when B is the transmitter and A the receiver. In other 
words, there exist unidirectional communication links in the 
secondary network. Since unidirectional links are difficult to uti¬ 
lize in wireless networks [19], we only consider bidirectional 
links in the secondary network when we define connectivity. As 
a consequence, when we determine whether there exists a com¬ 
munication link between two secondary users, we need to check 
the existence of spectrum opportunities in both directions. 

To summarize, under the disk signal propagation and inter¬ 
ference model, there is a (bidirectional) link between A and B 
if and only if (Cl) the distance between A and B is at most r p \ 
(C2) there exists a bidirectional spectrum opportunity between 
A and B , i.e., there are no primary transmitters within distance 
Rj of either A or B and no primary receivers within distance rj 
of either A or B. 

Connectivity: We interpret the connectivity of the secondary 
network in the percolation sense: the secondary network is con¬ 
nected if there exists an infinite connected component a.s. 

Based on the above conditions (Cl, C2) for the existence of a 
communication link, we can obtain an undirected random graph 
(/(As, Apr) corresponding to the secondary network, which 
is determined by three Poisson point processes: the secondary 

2 Since the minimum transmission power for successful reception is, in gen¬ 
eral, higher than the maximum allowable interference power, it follows that the 
transmission range R p of primary users is smaller than R r . Furthermore, under 
the disk signal propagation and interference model, we have R p — fiRr (0 < 
(3 < 1). A similar relationship holds for r p and rj. 



■ Primary Tx 
Primary Rx 
* Secondary User 


Fig. 4. Realization of the heterogeneous network. The random graph 
Q{ As, Ap T ) consists of all the secondary nodes and all the bidirectional links 
denoted by solid lines. The solid circles with radii Ri denote the interference 
regions of the primary transmitters within which secondary users cannot 
successfully receive and the dashed circles with radii r T denote the required 
protection regions for the primary receivers within which the secondary users 
should refrain from transmitting. 


users with density A 5 , the primary transmitters with density 
A pt and the primary receivers with density A pt (correlated 
with the process of the primary transmitters) 3 . See Fig. 4 for an 
illustration of (/(A 5 , A pp). 

The question we aim to answer in this paper is: when is the 
secondary network connected, i.e., when does percolation occur 
in (/(As, Apr)? 


III. Analytical Characterizations of the 
Connectivity Region 

Given the transmission power and the interference tolerance 
of both the primary and the secondary users (i.e., R p , Rj, r p 
and r 1 are fixed), the connectivity of the secondary network is 
determined by the density As of the secondary users and the 
density A pt of the primary transmitters. We thus introduce the 
concept of connectivity region C of a secondary network, which 
is defined as the set of density pairs (As, A pt) under which the 
secondary network (/(As, A pt) is connected (see Fig. 1) 

C = {(As, A pt) : (/(As,App) is connected.}. 


A. Basic Properties of the Connectivity Region 

We next establish three basic properties of the connectivity 
region. 

Theorem 1: Basic Properties of the Connectivity Region. 
Tl.l The connectivity region C is contiguous, that is, for 
any two points (Asi, Appi), (Xs 2 ^PT 2 ) £ C, there exists 
a continuous path in C connecting the two points. 

T1.2 The lower boundary of the connectivity region C is 
the As-axis. Let Ap T (As) denote the upper boundary of 
the connectivity region C defined by (1), then we have that 
Ap r (As) is monotonically increasing with As- 

3 The two Poisson point processes of the primary transmitters and receivers 
are essentially a snap shot of the realizations of the primary transmitters and 
receivers. In different time slots, different sets of primary users become active 
transmitters/receivers. Thus, even if a secondary user is isolated at one time due 
to the absence of spectrum opportunities, it may experience an opportunity at a 
different time and be connected to other secondary users. 
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Fig. 5. Two realizations of the Poisson heterogeneous network (black stars denote primary transmitters, green plus signs denote primary receivers, red dots denote 
secondary users and blue segments denote the bidirectional links between secondary users). We have removed secondary users who do not see opportunities for 
clarity. The simulation parameters are given by R p — 50 m, R T — 80 m, \s — 650 km -2 , r p — 50 m, rj = 80 m and the critical density in this case is 
A c (50) « 576 km -2 (a) percolation occurs (A PT = 10 km -2 ) (b) percolation does not occur (A PT — 20 km -2 ). 


T1.3 There exists either zero or one infinite connected com¬ 
ponent in Q(\s , A pt) a.s. 

Proof: The proofs of Tl.l and T1.2 are based on the 
coupling argument, a technique frequently used in continuum 
percolation [15, Section 2.2]. The proof of T 1.3 is based on 
the ergodicity of the random model driven by the three Poisson 
point processes of the primary transmitters, the primary re¬ 
ceivers and the secondary users. The details of the proofs are 
given in Section V-B. ■ 

Tl.l and T 1.2 specify the basic structure of the connectivity 
region, as illustrated in Fig. 1. T1.3 implies the occurrence of a 
phase transition phenomenon, that is, there exists either a unique 
infinite connected component a.s. or no infinite connected com¬ 
ponent a.s. This uniqueness of the infinite connected component 
excludes the undesirable possibility of having more than one 
(maybe infinite) infinite connected component in the secondary 
network. We point out that such a property is not always present 
in wireless networks. Two examples where more than one infi¬ 
nite connected component exists in a homogeneous ad hoc net¬ 
work can be found in [ 20 ]. 

B. Critical Densities 

In this subsection, we study the critical density A* s of the sec¬ 
ondary users and the critical density A PT of the primary trans¬ 
mitters defined by (2) and (3), respectively. We have the fol¬ 
lowing theorem. 

Theorem 2: Critical Densities. 

Given R p , Rj , r p and rj, we have: 

T2.1 A^ = A c (r p ), where A c (r p ) is the critical density for 
a homogeneous ad hoc network with transmission range r p 
(i.e., in the absence of the primary network). 

T2.2 

T* W _ (A\ 

PT — 4max{i£|,r|} — r p 

where the constant A c (l) is the critical density for a homo¬ 
geneous ad hoc network with a unit transmission range. 


Proof: The basic idea of the proof of T2.1 is to approximate 
the secondary network Q(A 5 , Apt) by a discrete edge-percola¬ 
tion model on the grid. This discretization technique is often 
used to convert a continuum percolation model to a discrete 
site/edge percolation model (see, for example, [15, Chapter 3], 
[ 8 ]). The details of the proof are given in Section V-C-I. 

The proof of T2.2 is based on the argument that if there is an 
infinite connected component in the secondary network, then an 
infinite vacant component must exist in the two Poisson Boolean 
models driven by the primary transmitters and the primary re¬ 
ceivers, respectively. The key point is to carefully choose the 
radii of the two Poisson Boolean models in order to obtain a 
valid upper bound on \* PT . The details of the proof can be found 
in Section V-C-II. ■ 

Fig. 5(a) shows one realization of the heterogeneous network 
when As is slightly larger than A c (r p ) and Apt is small. At least 
one left-to-right (L-R) crossing and at least one top-to-bottom 
(T-B) crossing can be found in the square network. It is thus 
expected that these L-R and T-B crossings in finite square re¬ 
gions can form an infinite connected component in the whole 
network on U 2 . If we slightly increase Apt , then we observe 
from Fig. 5(b) that the reduction in spectrum opportunities elim¬ 
inates numerous communication links in the secondary network, 
creating several disjoint small components. 

Fig. 6 shows a simulation example of the connectivity region, 
where the upper bound on the critical density A PT of the pri¬ 
mary transmitters given in T2.2 appears to be achievable. 

C. A Necessary Condition for Connectivity 

In this subsection, we establish a necessary condition for con¬ 
nectivity which is given in terms of the average conditional de¬ 
gree of a secondary user. This condition agrees with our intu¬ 
ition: the secondary network cannot be connected if the degree 
of every secondary user is small. 

Let i (A, <i, rx/tx) denote the event that there exist primary re¬ 
ceivers/transmitters within distance d of a secondary user A. Let 
II(A, <i, rx/tx) denote the complement of D(A, <i, rx/tx). Since a 
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Fig. 6. Simulated connectivity regions when r p — 150 m, rj = 240 m, 
R p — 100 m and R T — 120 m. The blue dashed line is the upper bound (4) on 
the critical density X* PT of primary transmitters given in T2.2. The area of the 
simulated heterogeneous network is 2000m x 2000 m. For a fixed density A s 
of the secondary users, the upper boundary X PT (\s) is equal to the minimum 
density of the primary transmitters such that over all the 1000 realizations, the 
percentage of the ones in which there exists at least one L-R crossing is below 
50%. The intuitive reason for choosing the existence of an L-R crossing as the 
criterion for connectivity is illustrated in Fig. 5 and is discussed in Section V-C. 

secondary user is isolated if it does not see a spectrum opportu¬ 
nity, we focus on secondary users who experience spectrum op¬ 
portunities and define the conditional average degree p of such 
a secondary user A as 

/1 = E[deg(A)\ I (A, rj, rx) n I (A, Rp tx)] (5) 

where deg (A) denotes the degree of A, rj the interference range 
of the secondary users and Rj the interference range of the pri¬ 
mary users. Notice that the degree of A is the number of sec¬ 
ondary users within the transmission range of A and experi¬ 
encing bidirectional opportunities with A. We arrive at the fol¬ 
lowing necessary condition for connectivity. 

Theorem 3: A necessary condition for the connectivity of 
&(\s, Xpp) is p > 1> where p is the conditional average degree 
of a secondary user defined in (5). 

Proof: The basic idea is to construct a branching process, 
where the conditional average degree p is the average number of 
offspring. This branching process provides an upper bound on 
the number of secondary users in a connected component. If p < 
1, then the branching process is finite a.s. It thus follows that 
there is no infinite connected component a.s. in G(Xs, Xpp). 
Details can be found in Section V-D. ■ 

To apply the necessary condition given in Theorem 3, the 
conditional average degree p of a secondary user A needs to 
be evaluated based on the network parameters. Let B be a sec¬ 
ondary user randomly and uniformly distributed within trans¬ 
mission range r p of A. Let g{Xpp , r p , 77 , R pi Ri) denote the 
probability of a bidirectional opportunity between A and B con¬ 
ditioned on the event that A sees an opportunity. Based on the 
statistical equivalence and independence of different points in 
a Poisson point process, the conditional average degree p of a 
secondary user A is given by this conditional probability gf) of 
a bidirectional opportunity between A and a randomly chosen 


neighbor multiplied by the average number of neighbors of A, 
i.e., 

p = (\ s irrl) ■ g(XpT, r p , r 7 , Rp, i? 7 ). ( 6 ) 

The detailed derivation of ( 6 ) and an expression for g(-) are 
given in Appendix A. It is also shown in Appendix A that g(-) is 
a strictly decreasing function of Xpp. Thus $ -1 (*), the inverse 
of gf) with respect to A pp, is well-defined. 

Combining ( 6 ) with Theorem 3, we obtain an outer bound on 
the connectivity region. Specifically, let p(X 5 , Xpp) denote the 
conditional average degree of a secondary user in Q(\ 5 , Xpp). 
Then those density pairs (As, Xpp) satisfying p(X 5 , Xpp) < 1 
are outside the connectivity region. 

Corollary 1: Given R p , Rp, r p and 77 , an outer bound on the 
connectivity region C is given by 

Apt= 5-1 GAi) 

where g _1 (-) is the inverse of the conditional probability gf) 
with respect to Xpp. 

D. A Sufficient Condition for Connectivity 

In this subsection, we establish a sufficient condition for con¬ 
nectivity, which provides an inner bound on the connectivity re¬ 
gion and a criterion for checking whether a secondary network 
is connected. 

Our sufficient condition for connectivity is established by 
using the discretization technique. The continuum percolation 
model is mapped onto a dependent site-percolation model C 
in the following way. As illustrated in Fig. 7, we partition U 2 
into (dashed) squares with side length d s and locate a site at the 
center of each square. Sites whose associated dashed squares 
share at least one common point are considered connected (as 
illustrated by solid lines in Fig. 7). Thus, each site is connected 
to eight neighbors 4 (see the eight neighbors Oi ,..., Os of site 
O in Fig. 7). Let Bo be the associated dashed square of O, then 
O is occupied if there exists in Bo at least one secondary user 
who sees an opportunity. 

Since the largest distance between two points in two neigh¬ 
boring dashed squares is 2\f2d s , it follows that if we set d s = 
then every pair of secondary users in two neighboring 
dashed squares are within transmission range r p of each other. 
Based on the definitions of occupied site in C and communi¬ 
cation link in the secondary network, we conclude that the ex¬ 
istence of an infinite occupied component (a connected compo¬ 
nent consisting of only occupied sites) in C implies the existence 
of an infinite connected component in the secondary network. 

Due to the fact that spectrum opportunities are spatially de¬ 
pendent, the state of one site is correlated with the states of its 
adjacent sites. Thus, the above site-percolation model £ is a de¬ 
pendent model. Define the dependence range k as the minimum 
distance such that the state of any two sites at distance larger 
than k are independent, where the distance between two sites 

4 For the commonly used square site-percolation model, each site has four 
neighbors. The site-percolation model constructed here provides a better inner 
bound. 
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Fig. 7. Illustration of the dependent site-percolation model C with side length 
d s (solid dots denote sites, solid lines denote edges connecting every two sites 
and dashed lines denote the squared partition). 


is the minimum number of neighboring sites that must be tra¬ 
versed from one site to the other. Then the dependence range of 
C is given by 



8max |Ri + p Xi + fr p 




Let p c denote the upper critical probability of C which is de¬ 
fined as the minimum probability of occupancy p* such that if 
the probability of occupancy p > p *, an infinite occupied com¬ 
ponent containing the origin exists in C with a positive prob¬ 
ability (wpp.). Since the dependence range k of C is finite, it 
follows from [12, Theorem 2.3.1] thatp c < 1. Now we present 
a sufficient condition for connectivity in the following theorem. 

Theorem 4: Let p c denote the upper critical probability of the 
dependent site-percolation model C specified above. Define 

Z(r, R p , n) = 2 ^ 2 ’ ~ dt (») 


where 5j(£, 77 ) is the common area of two circles with radii 

R p and 77 and centers t apart. Then the secondary network is 
connected if the equation shown at the bottom of the page holds. 


Corollary 2: A sufficient condition for the connectivity of 
£/(Asr, Apr) is 

1 1 — exp (— ) 

PT< IT [Rj+ ^-1(11!, R p ,n)\ !_ (i) (2fc+1 ) 2 

where /(i?p, R p , 77 ) is defined in (8) and k is the dependence 
range of the site-percolation model defined in (7). 

IV. Impact of Transmission Power: Proximity 

Versus Opportunity 

In this section, we study the impact of the secondary users’ 
transmission power on the connectivity and the conditional av¬ 
erage degree of the secondary network. As has been illustrated 
in Fig. 2, there exists a tradeoff between proximity and oppor¬ 
tunity in designing the secondary users’ transmission power for 
connectivity. Specifically, increasing the transmission power of 
the secondary users leads to a smaller critical density A^ of the 
secondary users, but at the same time, a lower tolerance to the 
primary traffic load manifested by a smaller critical density \* PT 
of the primary transmitters. 

A. Impact on the Conditional Average Degree 

As discussed in Section III-C, the expression for the condi¬ 
tional average degree p can be decomposed into the product 
of two terms: A s^r p and g(\pT-> t p , 77 , R p , Ri). The first 
term is the average number of neighbors of a secondary 
user, which increases with the transmission power p tx of the 
secondary users (i.e., enhanced proximity). The other term 
#(App, r p , 77 , R pi Rj) is the conditional probability of a bidi¬ 
rectional opportunity, which decreases with p tx due to reduced 
spectrum opportunities. This tension between proximity and 
opportunity is illustrated in Fig. 8, where we observe that 
the impact of p tx on proximity dominates when p tx is small 
(p increases with p tx ) while its impact on the occurrence of 
opportunities dominates when p tx is large (p decreases with 

Ptx)' 

Corollary 3: Let p tx be the transmission power of secondary 
users and p the conditional average degree defined in (5), then 
under the disk signal propagation and interference model we 
have 5 


Proof: The proof is based on the ergodicity of the hetero¬ 
geneous network model and its relation with the constructed 
dependent site-percolation model C. Details can be found in 
Section V-E. ■ 

By applying a general upper bound on the upper critical prob¬ 
ability p c for a site-percolation model with finite dependence 
range [12, Theorem 2.3.1], we arrive at the following corollary. 


li = 0 ((ptx) 2/a ) as p t x —► 00 

where a is the path-loss exponent. 

Proof: We show this corollary by deriving an upper bound 
on p. Details can be found in Appendix B. ■ 

5 Here we use the Big O notation: fix) — 0{g{x)) as x — » 00 if and only if 
3 M > 0, xq >0 such that \f(x)\ < M\g{x)\ for all x > , {•(). 


1 — exp 




8 


exp {-X pt tt [Rj + rj - I (i?j, R p , r/)] } > p c . 
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Fig. 8. Conditional average degree p of secondary users vs transmission range 

r p of secondary users. r p oc (p tx ) « , where p tx is the transmission power of 
secondary users and a is the path-loss exponent and simulation parameters are 
given by Ap T = 2.5 kin -2 , R p — 200 m, Ri — 250 m, A^ = 25 km -2 , 
n = r p / 0.8. 

For a homogeneous network, the average degree of a user is 
XnPp, which increases with p tx at rate ( p tx ) 2 ^ a . In sharp con¬ 
trast, this corollary tells us that for a heterogeneous network, 
when p tx is large enough, the conditional average degree p of 
a secondary user actually decreases with p tx at least as fast as 

o ptx)~ 2/a ■ 

B. Impact on the Connectivity Region 

From the scaling relation of the critical density [15, Propo¬ 
sition 2.11], we know that in a homogeneous two-dimensional 
network 


/ ^c(fp) A c (l) (xp) ^ ijPtx) a 

where the constant A c (l) is the critical density for a homoge¬ 
neous ad hoc network with a unit transmission range. Thus, 
if each secondary user adopts a high transmission power, then 
X c (r p ) decreases. It follows from T2.1 that the critical den¬ 
sity of secondary users required to achieve connectivity de¬ 
creases due to the enhanced proximity. 

On the other hand, from the upper bound on the critical den¬ 
sity A p T of the primary transmitters given in T2.2, we have that 

Ap r = O ((ptx)~ 2/a ) as p tx ^>oo 



Fig. 9. Example of the upper bound on X* PT as a function of ri (parameters 
are given by Ri — 120 m, r p — 0.625rj). 

leads to the “largest” connectivity region (largest in the sense 
that its connectivity region contains all regions achievable with 
any finite transmission power p tx of the secondary users). Thus, 
the optimal design of p tx depends on the operating point of the 
heterogeneous network. For instance, when a sparse secondary 
network is overlaid with a primary network with low traffic load, 
a large p tx may be desirable to achieve connectivity. The oppo¬ 
site holds when a dense secondary network is overlaid with a 
primary network with high traffic load. 

Focusing on a secondary network whose density exceeds the 
critical density of the secondary users, we address the design 
of its transmission power for the maximum tolerance to the pri¬ 
mary traffic. Due to its tractability and achievability, the upper 
bound on the critical density X PT of the primary transmitters 
given in T2.2 is used as the performance measure (see Fig. 6 for 
a simulation example). 

Theorem 5: Let rj and Rj denote the interference range of 
the secondary and the primary users. For a fixed Rj , the upper 
bound on X PT given in T2.2 is maximized when the primary and 
secondary networks have matching interference ranges: rj = 
Rj. 

Proof: Since under the disk signal propagation and inter¬ 
ference model, r p = firi for some (3 £ (0,1), the upper bound 
on X PT can be written as 


A 


* 

PT 


< 


Ac (1) 

4f?2_/32 r 2 


A c (1) 

(4-/3 2 )rj 


for ri < Rj 
for rj > Rj. 


where we have assumed that r p = f3rj for some (3 £ (0,1) 
under the disk signal propagation and interference model. Thus, 
when the transmission power p tx of the secondary network is 
large enough, the critical density X PT of the primary transmit¬ 
ters decreases with p tx at least as fast as {ptx)~ 2 ^ due to re¬ 
duced spectrum opportunities. 

C. Optimal Design of Transmission Power 

Due to the tension between proximity and opportunity, there 
does not exist a transmission power of the secondary users that 


Then the above theorem can be readily shown by finding the 
maximal point for the two cases: rj < Rj and rj > Rj. ■ 

An example of the upper bound on X PT is plotted as a func¬ 
tion of rj in Fig. 9. Notice that there is a distinct difference 
in the slope on the two sides of the optimal point. As a conse¬ 
quence, the operating region of rj < Rj is preferred over that of 
rj > Rj when the optimal point rj = Rj cannot be achieved. 
We point out that the desired operating region of rj < Rj is the 
typical case of a secondary network coexisting with a privileged 
primary network. 
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V. Proofs 

In this section, we present proofs of the main results presented 
in Sections III-IV. We start with a brief overview of several 
basic results in percolation theory that will be used in the proofs. 


A. Percolation Theory: Some Basic Results 

Poisson Boolean Model: Poisson Boolean model [15], often 
referred to as B(X< p, A), is specified by two elements: a 
Poisson point process X on U d with density A and a radius p, 
a random variable with a given distribution. Under this model, 
each point in X is the center of a ball in U d with a random 
radius distributed according to the distribution of p. Radii asso¬ 
ciated with different points are independent and they are also 
independent of points in X. Under a Poisson Boolean model, 
the whole space is partitioned into two regions: the occupied 
region, which is the region covered by at least one ball and the 
vacant region, which is the complement of the occupied region. 
We define occupied (vacant) components as those connected 
components in the occupied (vacant) region. 

Assume that nodes in a homogeneous ad hoc network form 
a Poisson point process with density A and their transmission 
range is r. It is easy to see that the connectivity of this network 
can be studied through examination of the occupied connected 
components in the corresponding Poisson Boolean model 
B(X, r/2, A). 

Sharp Transition in Two Dimensions: For the Poisson 
Boolean model in two dimensions, the sharp transition phe¬ 
nomenon is remarkable in the sense that the critical density 
for the a.s. existence of infinite occupied components is equal 
to that for the a.s. existence of infinite vacant components. 
Let A c (2p) denote the critical density for the Poisson Boolean 
model B(X, p , A), then we have that: 

• when A < A c (2p), there is no infinite occupied component 
a.s. and there is a unique infinite vacant component a.s.; 

• when A > A c (2p), there is a unique infinite occupied com¬ 
ponent a.s. and there is no infinite vacant component a.s. 

The exact value of A c is not known. For a deterministic radius p, 
simulation results [21] indicate that A c (2 p) 0.36p 2 , while 
rigorous bounds 0.192p -2 < A c (2 p) < 0.843 p~ 2 are provided 
in [15] and [22]. 

Crossing Probabilities: A continuous curve in the occupied 
region is called an occupied path. An occupied path 7 is an oc¬ 
cupied L-R crossing of the rectangle {0 < x < h} x {0 < 
V < ^ 2 } if 7 intersects both the left and the right boundaries 
of the rectangle, i.e., 7 n ({x = 0 } x {0 < y < 1%}) 7 ^ c/>, 
7 n {{x = h} x {0 < y < I 2 }) 7 ^ (j> and the segment between 
the two intersecting points is fully contained in the rectangle. 
Similarly, we define an occupied T-B crossing by requiring that 
7 intersects with the top and bottom boundaries of the rectangle. 
Let 


a((h,l 2 ), A, L-R) 

= Pr{3 an occupied L — R crossing of [0, ?i] x [M 2 ]} 
<x((M 2 ), A, T-B) 

= Pr{3 an occupied T — B crossing of [0, 1 1 ] x [0, Z 2 ]} 


A pp f 


X pp ^ 


(W25 A pp 2 ) 

# 




(XspXpTl) {Xs 2 ,XpTl) 


(Asi, Xppi) 


{^S2, Apri) 
- <D 


{Xs2i XpT2) 
-=>- 


(a) A pti 7 Xpp -2 A« 


(b) Xppi > Xpp2 A< 


Fig. 10 . Continuous path connecting the two points (A S i,Apti) 
and (A 52 , Apt 2 ) in the connectivity region C. (a) A PT i < A PT 2 - 

(b) A PTI > A PT2 ■ 


denote the two crossing probabilities in the rectangle [0, Zi] x 
[0, 12 ]. Then for a Poisson Boolean model B(X, p, A) in two 
dimensions with a.s. bounded p, we have [15, Corollary 4.1] 
that for any k > 1 


lim a((kn,n), A, L — R) 

n —> co 


1, if A > A c (2p) 

0, if A < X c (2p). K > 


Due to the symmetry of the Poisson Boolean model, 
similar results hold for the T-B crossing probability 

cr((n 1 kn ), A, T — B). 


B. Proof of Theorem 1 

Proof of T1.1: To prove Tl.l, it suffices to show that for any 
two given points (Asi, A pti) and (As 2 5 A PT 2 ) inC, we can find 
a path in C that connects these two points. In particular, the path 
we constructed is given by a horizontal segment and a vertical 
segment as shown in Fig. 10, where we assume, without loss of 
generality, that Asi < As 2 - 

Consider case (a) in Fig. 10 where A pti < A pt 2 • Case 
(b) can be proven similarly. First, we show every point 
(As, Xpti) (Asi < As < As2) on the horizontal segment be¬ 
longs to C. Let A' = As — Asi- A Poisson point process X with 
density As is statistically equivalent to the superposition of a 
Poisson point process X\ with density Asi and an independent 
Poisson point process X' with density A'. It follows that any 
realization of the heterogeneous network with densities As and 
Xpti can be generated by adding more secondary nodes to a 
realization of the heterogeneous network with densities Asi and 
A pti. Thus, the existence of an infinite connected component 
in (/(Asi, A pti) implies the existence of an infinite connected 
component in (/(As, Appi). We thus have that (As, Apn) G C 
for (Asi < As < As 2 )- 

Now we know that the two end points (As 2 ,Appi) and 
(As 25 Xpp 2 ) of the vertical segment belong to C. For a point 
(As 2 5 App) (Appi < A pp < Xpp 2 ) on the vertical segment, 
let A ; = Xpp 2 — Xpt , then any realization of the heteroge¬ 
neous network with densities As 2 and A pp can be obtained 
by independently removing each primary transmitter-receiver 
pair with probability X'/Xpt 2 from a realization of the het¬ 
erogeneous network with densities Xs 2 and Xpp 2 - It follows 
from the definition of communication link in the secondary 
network (see Section II-B-I) that the existence of an infinite 
connected component in (/(As 2 ? Xpp 2 ) implies the existence 
of an infinite connected component in (/(As,App). Thus, we 
have (As 2 , Xpt) G C (Apn < A pp < A pt2)- 
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Proof of Theorem 1.2: Suppose that (As, Xpt ) £ C (A pt > 
0), then by using the coupling argument for showing that the ver¬ 
tical segment belongs to C in the above proof of T 1.1, we con¬ 
clude that (As, 0) £ C, i.e., the As-axis is the lower boundary 
of C. 

Suppose that As 2 > Asi > 0. In order to prove the mono¬ 
tonicity of Ap T (As) with As it suffices to show that V A pp > 0, 
if (Asi, Xpp) £ C then (As2 ? Apr) £ C. This is a direct conse¬ 
quence of the coupling argument for showing that the horizontal 
segment belongs to C in the above proof of T 1.1. 

Proof of Theorem 1.3: We first establish the ergodicity 6 of 
the heterogeneous network model. 

Lemma 1: The heterogeneous network model is ergodic. 
Proof of Lemma 1: The proof of this lemma is inspired 
by the proof of the ergodicity of the Poisson Boolean model 
[15, Proposition 2.8]. The difficulty here is that for the hetero¬ 
geneous network model, we have two correlated Poisson point 
processes: the primary transmitters and the primary receivers. 
The definition of the shift transformation for the primary net¬ 
work model is thus more complicated than the standard Poisson 
Boolean model with a deterministic radius p. To prove Lemma 
1, we first show the ergodicity of the primary network model 
and then we show the mixing property 7 of the secondary net¬ 
work model. Since the primary network model is independent 
of the secondary network model, it follows from [24, Theorem 

2.6.1] that the heterogeneous network model is ergodic. 
d . 

Let 33 denote the Borel cr-algebra in R d and N the set of all 

d 

simple counting measures 8 on 33 . Construct a cr-algebra Jf 
for N generated by sets of the form 

{n £ N : n(A) = k} 

d 

where A £ 33 and k is an integer. A point process X can now 
be defined as a measurable mapping from a probability space 
(O, X , P) into (N, , » ) [23, Chapter 7]. The measure fi on 
pL induced by X is defined as p,(G) = P(X~ 1 (G)), for all 

g £ jr. 

In order to define the shift transformation on O, it is conve¬ 
nient to identify (0, X ) with (A r , pV). Let uj( A) denote the 

d 

number of points in A £ ^ , V ^ £ and T x be the shift 
according to a vector x £ R d . Then T x induces a shift transfor¬ 
mation S x : ft —► 0 through the following equation for every 

d 

Aef : 

(S' x w)(A) = lo(T~ 1 A). (10) 

6 A measure preserving (m.p.) dynamical system (O, X , /u, T) consists of 

a set fi, a a -algebra X of measurable subsets of fi, a nonnegative measure 

(i on (SI, X ) and an invertible m.p. transformation T : fi —* fi such that 

p(T~ 1 F) — fi(F) f° r all F £ X . A set F E X is said to be T-invariant if 

T~ 1 F — F. The m.p. dynamical system (O, X pi , T) is said to be ergodic 
if the cr-algebra of T-invariant sets is trivial, i.e., for any invariant set, either it 
has measure 0 or its complement has measure 0. 

7 An m.p. dynamical system (fi, X , //, T) is said to be mixing if for all 

E, F e X , lim fj,(T n E n F) = p(E)p(F). 

n - y oo 

d 

8 A simple counting measure on 33 is an integer-valued measure for which 
the measures of bounded Borel sets are all finite and the measure of a point is 
at most 1. 


Let ( ftpp , X pt i Ppt ) be the probability space of 
the Poisson point process Xpt for the primary transmit¬ 
ters with density A pt- Let ftpp be the product space 

rineN EUz 2 C R P for the P r ™ary receivers, where 
Cp p ~ {(x,y) : x 2 + y 2 < R p }. Then we equip ftpp with 
the usual product cr-algebra and product measure Ppr with all 
marginal probability measure being pp, where (ijj is a uniform 
probability measure on Cp . Finally, we set ftp = ft pt xftpp 
and equip ftp with the product measure Pp = Pp T x Ppr and 
the usual product cr-algebra. It follows that the primary network 
model is a measurable mapping from ftp into Npp x ftpp 
defined by (c opp-, ppr) — ^(^pt(ppt)^ ppr ), where Npp is 
specified in the definition of the point process. 

The positions of the primary transmitters corresponding to 
(c opp, ppr) £ ftpp x ftpp are easily known from uopp. 
For the primary receivers, the positions are obtained as follows. 
Consider binary cubes 

2 

K(n , z) := (zi + l) 2 -n ] for all n £ N and z £ Z 2 . 

i =1 

For each primary transmitter xpp, there exists a unique smallest 
integer no = no(xpp) such that it is contained in a binary cube 
K(n o, z(n o, Xpp)) which contains no other primary transmit¬ 
ters. The relative position of xpp’s receiver with respect to xpp 
is then given by ppr(tio^ z(no,xpp)). 

Let ei, 62 denote the unit vectors in R 2 , then the translation 
T ei : [R 2 —^ [R 2 (z = 1,2) defined by x —► x + ep induces a shift 
transformation U e% on ft pp through the equation 

(< U ei uj PR ){n , z) = Ppp(n , - 2 n e ?: ), for i = 1,2. 

Hence T e . also induces a shift transformation T e . on ftp = 
ftpp x ftpp as follows: 

T e% (pp) = ( S ei cupp , U ei Ppp ), for i = 1,2 

where S e% is defined in (10). By using techniques similar to the 
proof of Boolean models [15, Proposition 2.8], we have that the 
m.p. dynamical system (ftp, X p, Pp , T ei ) is ergodic. 

Since the transmission range r p of secondary users is fixed, 
the probability space of the secondary network model is the 
probability space (Op, X 5 , Pp) for the Poisson point process 
Xs of secondary users with density Ap. It follows from the 
proof of Poisson point processes [15, Proposition 2.6] that the 
m.p. dynamical system (Op, X p, Pp, S ei ) is mixing. 

Since the primary network model is independent of the sec¬ 
ondary network model, the sample space of the heterogeneous 
network model O can be written as the product of Op and Op, 
i.e., O = Op x Op. We equip O with product measure P = 

Pp x Pp and the usual product cr-algebra. Similarly, the trans- 

/\ 

lation T e . (i = 1,2) induces a transformation T e . on ft = 
Op x Op, which is given by 

PM = {fe^P,S et U S ). 

Then it follows from [24, Theorem 2.6.1] that the product m.p. 

/\ 

dynamical system (O, X , P, T e f) is ergodic. Since the cr-al- 
gebra invariant under the transformation group {T z : z £ Z 2 } 
is a subset of the cr-algebra invariant under the transformation 
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T ei , we conclude that {T z : z E Z 2 } acts ergodically, i.e., the 
heterogeneous network model is ergodic. ■ 

Based on Lemma 1, we have the following lemma. 

Lemma 2: The number of infinite connected component in 
(/(As, A pt) is a constant a.s. and it can only take value from 
{0, 1, oo}. 

Proof of Lemma 2: Let K denote the (random) number of 
infinite connected components in (/(As, A pp), then since for all 
k > 0, the event {K = k} is invariant under the group of shift 
transformations, it follows from Lemma 1 and the definition of 
ergodicity that Pt{K = k} = 0 or 1. Consequently, we have 
that K is a.s. constant. Then it suffices to exclude the possibility 
of K > 2. This is shown by contradiction, that is, if there exist 
K > 2 infinite connected components, then they can be linked 
together as one connected component wpp. The proof is inspired 
by the proof of Proposition 3.3 in [15] and a major difference is 
that here we need to consider the impact of the primary network 
on the connectivity of the secondary network. 

Suppose that there are K > 2 infinite connected components 
a.s. If we remove all the secondary nodes centered inside a box 
B = [—n,n] 2 , then the resulting secondary network should 
contain at least K unbounded components a.s. Let, for A C U 2 , 
Q [A] denote the graph formed by secondary nodes in A. Given 
a box B and e > 0, consider the event E(B,e) shown at the 
bottom of the page. 

Partition the box B into squares with side length a > 0 
and let S a = {5i,..., 5j\r} denote the collection of all the 
squares which are adjacent to the boundary of B. Clearly, for 
a box B and e > 0, we can find a = a(B,e) E (0,r p /\/5) 
and rf = rf(a) > 0 such that for any point x ^ B with 
d(x,B) < r p — e/2, there exists a square S = S(x) E $ a 
for which we have sup yG s d(x,y) < r p — 77 . This means that, 
if we center in each square of S a a secondary node and there 
are neither primary transmitters nor primary receivers within a 
bigger box B = [—n — max{rj, itp}, n + max{r/, Bp}] 2 , then 
every infinite component U in Q[B C ] with d(U , B) < r p — e is 
connected to some secondary node in S a . 

Let E(a,rf) be the event that each square in S a contains at 
least one secondary node and E(B) the event that there are nei¬ 
ther primary transmitters nor primary receivers within B. Since 
E(a, rj) depends on the configuration of secondary nodes inside 
the box B , E(B^e) depends on the configuration of secondary 
nodes outside B and the configuration of primary nodes, based 
on the independence of the primary network and the secondary 
network, we have 


Pr (E(B, e) H E(a , rf) H E(B)) 

= Pr (E(B, e))Pr(E(a, rf))Pr(E(B)\E(B, e)). 


If E(B , e), E(a , rf) and E(B) all occur wpp., then there is 
only one infinite connected component 9 wpp. By using argu¬ 
ments similar to the proof for [15, Proposition 3.3], we have 
that there exists a large enough box B and e > 0 such that 
Py(E(B< e)) > 0. Obviously, Pi(E(a 1 rf)) > 0. Moreover, it 
is easy to see that P(E(B)\E(B, e)) > P(E(B)) >0. ■ 

Now we have that the number K of infinite connected com¬ 
ponents is equal to zero, one or infinity a.s. To exclude the pos¬ 
sibility of K = 00, we can use the proof of [15, Theorem 3.6] 
for Poisson Boolean models, which is based on several combi¬ 
natorial results. The details are omitted. 

C. Proof of Theorem 2 

Proof of T2.1: To prove T2.1, it suffices to show that: 

(a) for any As < A c (r p ), the secondary network is not con¬ 
nected for any A pt > 0; 

(b) for any As > A c (r p ), there exists a Ap r (As) > 0 
such that V A pt < Ap r (As), the secondary network is 
connected. 

From Section IV-A, we know that for a Poisson homogeneous 
ad hoc network with density A and transmission range r, the nec¬ 
essary and sufficient condition for connectivity is A > A c (r). 
Since the existence of an infinite connected component in the 
secondary network implies the existence of an infinite connected 
component in the homogeneous ad hoc network with the same 
density and the same transmission range, by using a coupling 
argument, we conclude that when A^ < A c (r p ), there does not 
exist an infinite connected component a.s. in the secondary net¬ 
work for any A p T > 0. This proves part (a). 

The basic idea of the proof of part (b) is to approximate the 
secondary network (/(As, A pt) by a discrete dependent edge- 
percolation model on the grid. This discrete dependent edge- 
percolation model C is constructed in a way such that the exis¬ 
tence of an infinite connected component in C implies the exis¬ 
tence of an infinite connected component in (/(As, App). 

Construct the square lattice C on U 2 with side length d e . Note 
that each site in C is virtual and is not related to any node either 
in the secondary network or in the primary network. Next we 
specify the conditions for an edge being open in £, which is the 
key to the mapping from (/(As, App) to C. 

For each edge e in C , let ( x e , y e ) denote the middle point of e. 
Then we introduce three random fields A e , B e and C e , all asso¬ 
ciated with the edge e in £, where C e = A e B e is the indicator 
of the edge e being open, A e represents the condition (Cl) of 
the distance between two users for the existence of a commu¬ 
nication link in the secondary network and B e represents the 
condition (C2) of the spectrum opportunity. Specifically, con¬ 
sider the Poisson Boolean model B(Xs , r p / 2, As) where Xs 

9 Since a < r p /\fb, every secondary node in a square of S a is connected to 
those secondary nodes in the neighboring squares. 


E(B,e) := {d(U,B) < r p 


e for any infinite connected component U in Q[B C ] \ 
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Fig. 11. Realization where A e = 1 for the edge e (hollow points are sites in 
£ and solid segments are edges in £). 



is the Poisson point process generated by secondary users, then 
for a horizontal edge e, A e = 1 if the following two events (il¬ 
lustrated in Fig. 11) are true: 

(i) there is an occupied L-R crossing of the rectangle 
\x e 3rf e /4, x e + 3c£ e/ /4] x [z/ e c? ey /4, y e 4- d e /A\ in 
B{Xs, r p /2, A 5 ); 

(ii) there are two occupied T-B crossings of the square [x e — 

3cZ e /4, x e — d e /4] x[y e — d e / 4, y e + d e /4] and the square 
\x e + d e f 4, x e + 3<i e /4] x [z/ e d e f 4, y e 4- d£ e /4] in 

B(Xs , r p / 2 , As); and A e = 0 otherwise. For a vertical 
edge e, the definition of A e is similar, where the horizontal 
and vertical coordinates are switched in the above two 
events. 

Next, we define the random field B e . For an edge e in £, 
B e = 1 if A e = 1 and the following two events are true: 

(i) there is no primary transmitter within distance Rj of any 
secondary node of the three crossings in the definition of 

A e ; 

(ii) there is no primary receiver within distance 77 of any sec¬ 
ondary node of the three crossings in the definition of A e ; 

and B e = 0 otherwise. It follows from the definition of com¬ 
munication link in the secondary network (see Section II-B-I) 
that if A e = 1 and B e = 1, then the three crossings in 
B{Xs , r p [ 2, As) are also three crossings in (/(As, Apr). 

Let C e = A e B e , then we claim that the edge e is open if 
C e = 1 and e is closed if C e = 0. We observe from Fig. 11 
that whether the edge e is open is correlated with the states of 
the other edges. This model C thus is a dependent edge-perco¬ 
lation model. Furthermore, as shown in Fig. 12, if there exists 
an infinite open connected component in C , then those cross¬ 
ings associated with the edges in the infinite component in C 
comprise an infinite connected component in (/(As, Apr). As a 
consequence, by considering the uniqueness of the infinite con¬ 
nected component in (/(As, Apr), we only need to prove the 
following lemma in order to show T2.1. 

Lemma 3: Let C(O) denote the open connected component 
containing the origin O in C. Then given As > A c (r p ),3 D > 
0, A p T > 0 such that for d e = D and any Xp T < Ap T , we have 

Pr{|C(0)| = 00} > 0 

where |C(0)| is the number of edges in C(O). 


Fig. 12. Percolation in £ (thick segments are open edges in £ and thin seg¬ 
ments are closed edges in £ and blue curves are those crossings associated with 
the open edges). 


Proof of Lemma 3: For an arbitrary edge e in £, let q = 
Pr{C e = 0}, then we have 

q = Pr{(A e = 0)U (B e = 0)} < Pr{A e = 0}+Pr{B e = 0}. 


From the result on the crossing probabilities given in (9), we 
know that when As > A c (r p ). 


Pr{A e 


0} =Pr{at least one crossing does not exist} 



As, L - Rj 
, As, T - b) 
, As, T - b) 


0 as d 


oo, z.e., 


d , 


lim Pr{A ( 


oo 


0 } = 0 . 


Thus, when As > A c (r p ), Ve > 0, 3 D > 0 such that Pr{A e = 
0} < | for d e = 0. 

Given A e = 1, let Sr t be the area of the region covered by 
the circles with radii Rj centered at those secondary nodes in 
the three crossings and S ri be the area of the region covered by 
the circles with radii r/ centered at those secondary nodes in the 
three crossings. Then we have 


Pr {B e = 0 | A e = 1} 

= Pr{3 some primary transmitter in Sr t } 

+ Pr{3 some primary receiver in S ri }. 

Since Sr t < (^r + 2Rj + r p ) (^ + 2 Rj + r p ) and 

S rj < (^r + 2 rj + r p ) (^f + 2 rj + r p ), it follows from 

the basic property of Poisson point processes that 


Pr {B e = 0 
< 1 — exp 

+ 1 — exp 


A e = 1} 

—A pt rp ^ r P^j 

+ 2 n + r p ) 


-A 


3 d f 


p T 


+ 2 rj + r 
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Obviously, lim Pr {B e = 0 | A e = 1} = 0 for fixed d e 

A pt —^ 0 

Thus if we choose d e = D , then Ve > 0, 3 X PT > 0 such that 


Pr {B e = 0 | A e = 1} < - for all Apr < \ PT . 

3 

It implies that when d e = D, for all App < X PT 


Pr {B e = 0} = Pr{ A ( 
< Pr{A f 


= 0}+Pr{F? e =0 | A e ~ l}Pr{A f 
— 0} + Pr {B e = 0 | A e = 1} 


= 1 } 


< 


2e 


Thus, for d e = D and all App < X PT , we have 


& 


PrLmary TTu^-sin itcei 
- Secondary Usa 

Fig. 13. Counterexample for choosing Rpp — Ri • All the secondary nodes 
in the infinite path are located outside those circles centered at the primary 
transmitters with radii Rp, which form a series of rings surrounding the origin 
O and there is no infinite vacant component in the Poisson Boolean model 
B{Xp T j Ri , Ap T ) driven by the primary transmitters. 


El 

® # a i ■ ■ i 

R 


q < Pr{A e = 0} + Pr {B e = 0} < e. (11) 


From Fig. 11, we can see that if d e > ma.x{4 Rj + 2 r p , 4 rj + 
2 r p }, then the state of edge e is only correlated with its six ad¬ 
jacent edges and it is independent of other edges. In this case, 
by using the “Peierls argument” [25, Chapter 1], we can show 

that if the probability of an edge being closed q < ( 11 ~ 2 _ V ^Q ) , 


then 



Fig. 14. Infinite path in the secondary network. The dashed segments form an 
inner bound on the infinite vacant component in the Poisson Boolean model 
driven by the primary receivers. 


Pr{|C( 0 )| = 00} > 0 . (12) 

The proof of the above statement follows the one of [7, Theorem 
3] except that the upper bound on the probability of n edges all 
being closed is given by the following fact. 

Fact 1: [8, Proposition 1]: For any collection {ei}™ =1 of n 
distinct edges in £, we have 

Tt 

Pr{(Ci = 0) n (C 2 = 0) n • • • n (c n = o)} < q * 

where Ci is the indicator of e* being closed, and q is the prob¬ 
ability of an edge being closed. 

Thus, by combining (12) with (11), we conclude that given 
Xs > X c (r p ), 3 D > 0, X PT > 0 such that for fixed d e = 
max{D, 4 Rj + 2 r p , 4 n + 2 r p } and any A pp < X PT 

Pr{|C( 0 )| = 00} > 0 . 

Notice that X PT depends on D which is chosen according to the 
crossing probability and is determined by A5. As a consequence, 
X PT is a function of A 5 , i.e., X PT = X PT (X 5 ). ■ 

Proof of T2.2: From the conditions for the existence of 
a communication link in the secondary network specified in 
Section II-B-I, we know that for every secondary node in an in¬ 
finite connected component, there can exist neither any primary 
transmitter within distance Rj of it nor any primary receiver 
within distance rj of it. In other words, every secondary node 
in an infinite connected component must be located outside 
all the circles centered at the primary transmitters and the 
primary receivers with radii Rj and 77 , respectively. Thus, 
if there is an infinite connected component in the secondary 
network, then an infinite vacant component must exist in 
the two Poisson Boolean models B(X P t, Rpti Xpp) and 
t3(XpR, i?pp, A pp) driven by the primary transmitters and 
the primary receivers, respectively. Here i?pp and Rpr are 


some appropriate radii which will be specified later. A natural 
choice for R PT is i?p, but if we consider the counterexample 
given in Fig. 13, then we can clearly see that even if there is an 
infinite path in the secondary network, no infinite vacant com¬ 
ponent exists in the Poisson Boolean model S(Xpp, i?p, A pp) 
driven by the primary transmitters. Similarly, counterexamples 
can be easily constructed for choosing Rpr = 77 . 

Suppose there is an infinite connected component in the sec¬ 
ondary network. Then we can find a sequence of secondary 
users {Si, $ 2 , £ 3 ,...} suc h that they comprise an infinite path 
starting from Si (see Fig. 14). 

Assume that Si and S$+i {i > 1) are two adjacent secondary 
nodes in the above infinite path. Notice that the distance di, i+ 1 
between Si and S ^ + 1 satisfies ^ 7+1 < r p < r/, where the 
second inequality r p < 77 follows from the fact that the min¬ 
imum transmission power for successful reception is in general 
higher than the maximum allowable interference power. 

As we know, all the primary receivers must be outside the 
two circles with radii 77 centered at Si and S^+i, respectively, 
as shown in Fi§. 15. Given e > 0, consider the rectangle 

— di ^ +1 ? fxd± x e ] between Si and Si+ 1 . By a 

simple computation in geometry, we have that the minimum 
distance from all the primary receivers to the rectangle is 

\J r j ~ — e. As illustrated in Fig. 14, it implies that there 
exists an infinite vacant component in the Poisson Boolean 

model B ^App, \J r j ~ ~f ~ € > ^pt^J driven by the primary 

receivers 10 . By recalling the known results in Section V-A-II, 
we thus conclude that for all e > 0 

Apr < ( 2 ^/rj — Tp/4: - e) A c (l). 

10 This technique used here can also be applied to the case when r p > r J? 
where only the minimum distance from all the primary receiver to the bar be¬ 
tween Si and Si + i needs to be recomputed. 
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Fig. 15. One edge (S * 3 Si+i) in the infinite path. 


Let e —* 0, then it yields 


A 




The other term AR 2 _ 2 A c (l) in the upper bound is obtained 

by applying the same argument to the Poisson Boolean model 
driven by the primary transmitters. 


D. Proof of Theorem 3 

Consider the connected component Ca containing 
an arbitrarily chosen secondary user A. Assuming that 
| Ca | > 1, we construct a branching process as follows. 

Notice that if | Ca \ > 1, then A must see an opportunity, i.e., 

II(A, 77 , rx) fl l(A, R/,t * i) ii) * * * * * * * x ) is true. Call A the initial point (or 
0-th generation) of the branching process. Then the children 
of A (i.e., the first generation of the branching process) are 
secondary users which satisfy the following two conditions: 

i) it is within distance r p of A, where r p is the transmission 
range of secondary users; 

ii) there exist neither any primary receiver within distance r/ 
of the secondary user nor any primary transmitter within 
distance Rj of the secondary user. 

The n-th (n > 2) generation of the branching process is ob¬ 
tained similarly and these children are connected to their par¬ 

ents in the (n — l)-th generation of the branching process via 
bidirectional links. Obviously, all the secondary users in Ca are 

counted in the constructed branching process model. But some 

of them may probably be counted more than once, since we do 
not exclude the previous n generations (including generation 
0) when we consider the n-th generation. Thus, this branching 
process gives us an upper bound on the number of secondary 
users in Ca • It follows that if the branching process does not 

grow to infinity wpp., then there does not exist an infinite con¬ 
nected component a.s. in Q(A 5 , Apr), due to the stationarity 

of the heterogeneous network model. Since the conditional av¬ 
erage degree p is the average number of offspring for every gen¬ 
eration, the necessary condition follows immediately from the 
classic theorem for branching processes 11 [1, Theorem 2.1.1]. 

n The lower bound on the conditional average degree can be improved by 
constructing a multi-type branching process. Specifically, when we consider the 
children of a secondary user, we categorize them into different types based on 
their distances from the secondary user. A similar construction has been used 
to obtain the lower bound on the critical density of Poisson Boolean models [1, 

Sec. 3.9]. This multi-type branching process is, however, difficult to analyze and 
requires substantial further investigation. 


E. Proof of Theorem 4 

From the construction of the dependent site-percolation 
model C , we know that the existence of an infinite occupied 
component in C implies the existence of an infinite connected 
component in (/(As, Apt)- Then in order to obtain a sufficient 
condition for the connectivity of the secondary network, it 
suffices to find a sufficient condition for the existence of an 
infinite occupied component in C. 

Let p be the probability that one site is occupied. Then based 
on the definition of the upper critical probability p c of £, we 
have that if p > p c , an infinite occupied component containing 
the origin exists in C wpp. It implies that if p > p c , there exists 
an infinite connected component in the secondary network wpp. 
Since the event that there exists an infinite connected component 
in the secondary network is invariant under the group of shift 
transformations, it follows from the ergodicity of the heteroge¬ 
neous network model (see Lemma 1) that if p > p c , there exists 
an infinite connected component in the secondary network a.s. 

Let Fk(Bo) denote the event that there exist exactly k sec¬ 
ondary users in the associated dashed square Bo of a site O and 
let Gi(Bo) (1 < i < k) denote the event that the secondary 
user Ai sees an opportunity. Then based on the definition of oc¬ 
cupied site in £, we obtain a lower bound on p as follows: 

00 

p= Y^MFk(Bo)}Pv {uUGi(Bo)} 

k = 1 

OO 

> ^Pr{F fe (B 0 )}Pr{G 1 (B 0 )} 

fc = 1 

OO 

= Pr{G 1 (B 0 )}^Pr{Pfc(£?o)} 

fc= 1 

= [1 - exp(-Ase^)]Pr jl (Ai,r/,rx) n 1 (Ai,i2/,tx)| 

exp{- Apttt [R 2 I +r 2 I -I(R I ,R pj r I )]}. 

In the last step, Pr |ll (Ai, 77 , rx) n 0 (Ai, i?j, tx) j has been 
obtained by setting the distance d = 0 in the expression for the 
probability of an unidirectional opportunity between two sec¬ 
ondary users with distance d apart given in [16, Proposition 1]. 

VI. Conclusion and Future Directions 

We have studied the connectivity of a large-scale ad hoc het¬ 
erogeneous wireless network in terms of the occurrence of the 
percolation phenomenon. We have introduced the concept of 
connectivity region to specify the dependency of connectivity 
on the density of the secondary users and the traffic load of 
the primary users. We have shown several basic properties of 
the connectivity region: the contiguity of the region, the mono¬ 
tonicity of the boundary and the uniqueness of the infinite con¬ 
nected component. We have analytically characterized the crit¬ 
ical density of the secondary users and the critical density of 
the primary transmitters; they jointly specify the profile of the 
connectivity region. We have also established a necessary and a 
sufficient condition for connectivity, which give an outer and an 


















68 





Rp,t,r T ) 


Fig. 16. Illustration of Si (t, r i, r 2 ) (the common area of two circles with radii 
r i and r 2 and centers f apart), <Sf/ 2 (t , ri, r 2 ) (the union area of two circles with 
radii r 1 and r 2 and cents t apart) and 5'/ 2 (r, 0 , i? p , f, r r ) (the intersection area 
between one circle with radius R p and the union of the two identical circles 
with radii r T ). 


Proposition 1: Let A s and A pt be the density of secondary 
users and primary transmitters. Let rj and Rj be the interfer¬ 
ence range of the secondary and primary users and r p and R p 
the transmission range of the secondary and primary users. Let 
g(^PT>r p ,ri, R p , Ri) denote the probability of a bidirectional 
opportunity between A and B conditioned on the event that A 
sees an opportunity. Then the conditional average degree g, of a 
secondary user is given by 


p = (A 5 7rrp) • < 7 (Ap r ,r p ,r/,i?,p,i?/) 


= XsTTT 




>2 1 I r 

-^exp < - Ap r 7r(rf + Rj + I(Ri, R p , r/)) 

o r P y 

— Si(t, ri, ri) — Si(t, Ri, Rj) 

S I2 (r,9,R p ,t,r I ) 


Su2 (t,Rj ,Rj) 


7tR^ 


rdrdO 


d t 


(Al) 


where 

I(Ri, R P , rj) = 2 £ T 


inner bound, respectively, on the connectivity region. Further¬ 
more, by examining the impacts of the secondary users’ trans¬ 
mission power on the connectivity region and on the conditional 
average degree of a secondary user, we have demonstrated the 
tradeoff between proximity and spectrum opportunity. In es¬ 
tablishing these results, we have used techniques and theories 
in continuum percolation, including the coupling argument, er- 
godic theory, the discretization technique, and an approximation 
using a branching process. 

To highlight the interaction between the primary users and the 
secondary users in a heterogeneous network, we have ignored 
the fading effect and the mutual interference between secondary 
users. If we take into account these factors, then the received 
signal to interference-plus-noise ratios at two secondary users 
will replace the distance between them in condition (Cl) for the 
existence of a communication link between them. This will re¬ 
sult in a random connection model (RCM) with correlated links, 
where the correlation between links is due to the mutual inter¬ 
ference and condition (C2) on the presence of the bidirectional 
opportunity. There will be a tradeoff between proximity and mu¬ 
tual interference in addition to the tradeoff between proximity 
and opportunity and the combination of these two tradeoffs will 
complicate the characterization of the connectivity of the sec¬ 
ondary network. But since the RCM shares several basic prop¬ 
erties (e.g., ergodicity and existence of the critical density) with 
the Boolean model used in this paper, we expect that most of the 
results established here (e.g., connectivity region, critical den¬ 
sities) can be extended to the RCM. 

Appendix A 

Expression for Conditional Average Degree 


Si(t, ri, 77 ) the common area of two circles with radii 77 and 
77 and centers t apart [see Fig. 16(a)] and 77 , 77 ) is the 

union of two circles with radii 77 and 77 and centers t apart [see 
Fig. 16(b)]. Sj 2 (r, 9, R p , t, 77 ) is the intersection area between 
one circle with radius R p and the union of the two circles with 
both radii rj [see Fig. 16(c)]. For Sj 2 (r, 9, R p ,t,ri), the two 
identical circles are centered t apart and the other circle is cen¬ 
tered at (r, 9 ), where the middle point of the centers of the two 
identical circles is chosen to be the origin O. 

Expressions for I(Rj, R p ,rj) and Sj{t,ri,r 2 ) can be ob¬ 
tained in explicit form, which can be found in [1, Appendix A]. 
The expression for Sj 2 (r, 9 , R p , t , 77 ) depends on the expres¬ 
sion for the common area of three circles which is tedious and 
is given in [26]. By applying the basic property of the exponen¬ 
tial function to (Al), we can easily show that g(-) is a strictly 
decreasing function of A pt- 

Proof: Let Fk (A) denote the event that there exist exactly k 
neighbors of a secondary user A. We thus have the first equation 
shown at the bottom of the next page. 

When k = 0, it is obvious that deg ( A) = 0. When k > 0, let 
Bj be a neighbor of A and 1 Bi an indicator function for Bj such 
that 1 Bi = 1 if D(F?j, 77 , rx) n II (Bi, Rj , tx) occurs and 1 Bi — 
0 otherwise. Then by considering the statistical independence 
and equivalence of the k secondary users, we have the second 
equation shown at the bottom of the next page. It follows that 
we get (A2), as shown at the bottom of the next page. 

According to the definition of spectrum opportunity, 
Pr{D(A, rj, rx) n II(A, it*/, tx)} can be obtained by setting 
the distance d = 0 in the expression for the probability of a 
unidirectional opportunity between two secondary users with 
distance d apart given in Proposition 1 in [16]. 


An expression for the conditional average degree p of a sec¬ 
ondary user is presented in the following proposition. 


Pr{D(A, r/, rx) n l(A, Rj , tx)} 

= exp[-Ap T 7r(r| + Rj - I(Ri, R p , r/))] (A3) 
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Next, we derive the expression for the probability 
of a bidirectional opportunity, i.e., Pr{l(i?i, 77, rx) H 
D(Bi, Rj, tx)nll(A, ri 7 rx) n D(A, /?,/, tx)}, which depends 
on the location of B\ only through its distance to A. Since B\ 
is uniformly distributed within distance r p of A , the density 
function of the distance t between B 1 and A is given by M 

r p 

for 0 < t < r p . In this case, the probability of a bidirectional 
opportunity can be written as shown in (A 4 ), also shown at 
the bottom of the page, where the integrand can be written as 
shown in (A 5 ), at the bottom of the page. 


Next, we compute the two probabilities in (A 5 ) one by one. 
Since the primary receivers admit a Poisson point process with 
density A pt, we have 


Pr{D(5i, 77 , rx) n D(A, r/,rx)| d(B 1 , A) = t} 

= exp[-Ap T (27rr| - r 7 , rj))] (A 6 ) 

where Sj(t, 77 , 77) is the common area of two circles with both 
radii 77 and centers t apart [see Fig. 16 (a)]. 


li = E [deg (A) | I (A, 77, rx) n D (A, i?j, tx)] 

= Efc[E[de^(A) | I (A, 77, rx) n l(A,i 2 j,tx) H F fc (A)]] 

00 (A 2\ fc 

= £ e-^——E^ec^A) | l(A,n,rx) n i(A,R T ,tx) n F k (A)] 


k =0 


E [deg(A) | D(A, 77, rx) n II(A, i?j, tx) n Ffc(A)] 

k 

= ^2 E [1 Bi | l(A, r/, rx) n l(A, i?/, tx)] 

i =1 

= fcE[lsi | D(A, 77, rx) n l(A, tx)] 

= fePr{D(Bi, 77, rx) n D(F?i, i?j, tx) | If A, 77, rx) n I(A, Ri , tx)} 
^Pr{I(}Bi, 77, rx) D D(E?i, i?/, tx) H D(A, 77, rx) D I (A, i?j, tx)} 

Pr{D(A, 77, rx) H D(A, it* */, tx)} 


fi = As7rr 


2 Pr{I(iii, 77, rx) H D(F>i, it*/, tx) D I(A, 77, rx) H D(A, it*/,tx)} 

p 


Pr{D(A, 77, rx) H D(A, it*/, tx)} 


(A 2 ) 


Pr{D(Bi, 77, rx) H D(i?i, i?j, tx) H I(A, 77 ,rx) H D(A, it*/, tx)} 

f r p 2 1 _ _ _ _ 

= / — Pr{D( 5 i, 77, rx) H D(E?i, it*/, tx) H I(A, 77, rx) H D(A, R /, tx)| d(i?i, A) = t}dt 

J 0 T n 


(A 4 ) 


Pr{D(E?i, 77, rx) D B(i?i, i?,/, tx) n I (A, 77, rx) H I (A, it*/, tx) | d(B 1, A) = f} 

= Pr{D(i?i,it!/, tx) H I(A, Ri , tx) | D(E>i, 77,rx) n D(A,77,rx) n d(E?i, A) = £} 

• Pr{D( 5 i, 7 */, rx) n I(A, 77, rx) | d(E?i, A) = £} 


(A 5 ) 
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Pr{l(i?i, Ri 7 tx) H l(A, i2/,tx) | D(F?i, rj, rx) n D(A, 77 , rx) flc?(i?i,A) = t} 

S I 2(r,0 1 R p A 1 r I ) 


= exp < —A 


PT 


Su2(t,Ri,Ri) 


1 - 


7rR,p 


rdrd9 




= exp < —A pt 


27 tR} — S T (t , Rp Ri) — J J 


Sn(r, O.RpR.r!) 


Su2(t,Ri ,Ri) 


PRy 


rdrdO 


(A7) 


M = A 5 7rrp 


f rp 21 

^ / 

/ 9 exp 

—Apr 

! 0 r p 
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7rr j — 2rj arccos 



dt. 


ft < As7rr 
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2t 
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exp < —A pt 


< As7rr 
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p 

2t 


7rrj — 2rf 


7r £ 

2 ~ 2 ^ 


+ ^/r| - j 


P / 

0 ' p 


exp(—XpTtrj)dt 


= As'TT 

2A 5 tt 

— \ 2 
a pt 


2/3 


ALr? A 2 DT1 r 


2 

PT' I 


exp(-A ptPvj) - exp(-Ap r ^rj) 


A 


PT 


(^/) 


-2 


d£ 


Pr{l(5i, 77 , rx) H l(F?i, i?p, tx) H D(A, rj, rx) H fl(A, Rp tx) | d(Bi 7 A) = £} 


= exp < — A 


PT 


27r(r| + Rj) - S T (t, rj, r T ) - S T (t, R T , Rj) 


Si2{r,0,Rp,t,r!) 


Sjj 2 (^)P/ )Rl ) 


nRp 


rdrdO 


(A8) 


Let Xpp denote the Poisson point process formed by 
primary transmitters. If we remove from Xpp primary trans¬ 
mitters whose receivers are within distance rj of B\ or A, 
then it follows from the Coloring Theorem [17, Chapter 5] that 
all the remaining primary transmitters form another Poisson 


point process with density A pt 


1 - 


Si2(r,0 ,R P ,tpi ) 


*R% 


where 


S / 2 (c R P , £, rj) is the area of the circle with radius R p and 
centered at (r, 9) intersecting the two circles both with radii 
rj and centers^ apart [see Fig. 16(c)]. We thus have (A7), at 
the top of the page, where Su 2 (t< Rt Ri) is the union of two 
circles with both radii Rp and centers^ apart [see Fig. 16(b)]. 

Substitute (A6), (A7) into (A5), we have (A8), as shown at 
the top of the page. The expression for the conditional average 
degree p thus follows by plugging (A8) into (A4) and then (A3), 
(A4) into (A2). 


Appendix B 
Proof of Coroffary 3 

From [1, Appendix A] and Figs. 16(b) and (c), we know that 
when rj > R p + Rp 


I(Ri,R p ,ri) =R 2 j 


(Bl) 



S I2 {r,9..R p ,t,r!) 


Su2(t,Ri ,Ri) 


irRp 


rdrd9 = Su 2 (t, Ri, Ri) 


— 2pRp — Sj{t , Rj 1 Rp). 

(B2) 


Substitute (Bl) and (B2) into (Al), we have 


fj, = \ s pr 


>rp 21 


P I r 2 

0 ' p 


exp [—^PTpr] — Sj(t, rj, r/))]di. (B3) 
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Plugging the expression for S/(f, rj, rj) [1, Appendix A] into 
(B3) yields the equation shown at the top of the previous page. 

By applying the inequality arccosfx) < xforO < x < 1, 
we have the equation at the top of the previous page, where we 
have assumed that r p = /3ri (0 < f3 < 1) under the disk signal 
propagation and interference model. Since 77 oc (ptx) 1 ^, we 
arrive at Corollary 3. 
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Summary Seizure prediction has proven to be difficult in clinically realistic environments. Is it 
possible that fluctuations in cortical firing could influence the onset of seizures in an ictal zone? 
To test this, we have now used neural network simulations in a computational model of cortex 
having a total of 65,536 neurons with intercellular wiring patterned after histological data. A 
spatially distributed Poisson driven background input representing the activity of neighboring 
cortex affected 1% of the neurons. Gamma distributions were fit to the interbursting phase 
intervals, a non-parametric test for randomness was applied, and a dynamical systems analysis 
was performed to search for period-1 orbits in the intervals. The non-parametric analysis sug¬ 
gests that intervals are being drawn at random from their underlying joint distribution and the 
dynamical systems analysis is consistent with a nondeterministic dynamical interpretation of 
the generation of bursting phases. These results imply that in a region of cortex with abnormal 
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connectivity analogous to a seizure focus, it is possible to initiate seizure activity with fluctu¬ 
ations of input from the surrounding cortical regions. These findings suggest one possibility for 
ictal generation from abnormal focal epileptic networks. This mechanism additionally could help 
explain the difficulty in predicting partial seizures in some patients. 

© 2011 Elsevier B.V. All rights reserved. 


Introduction 

Epileptic seizures are brief, episodic phenomena. Partial 
seizures, the most common seizure type, arise from focal 
brain regions (e.g. temporal, parietal) (Niedermeyer, 2005). 

While in some instances there may be an identifiable 
cause for the seizures (e.g. tumor, cavernoma, hippocam¬ 
pal sclerosis), in other instances no clear pathology is 
determined. The hallmark of an epileptic seizure is the 
involvement of local or regional neural networks; repeti¬ 
tive firing of a single neuron does not produce symptoms 
without this network involvement. What causes the interic- 
tal to ictal transition? A typical partial seizure lasts less than 
2 min plus any postictal state (Afra et al., 2008). Therefore, 
even if a patient has very frequent seizures, the majority 
of time is spent in the interictal state. While some seizures 
can be provoked or are more likely to occur under certain 
situations (e.g. sleep deprivation, photic stimulation), the 
majority of seizures appear to occur spontaneously without 
known association with definable influences. 

There has been considerable interest in seizure predic¬ 
tion in recent years. Obviously if seizures could be reliably 
predicted, then the option for targeted therapy exists (e.g. 
stimulation), or at least the patient could remove them¬ 
selves from potentially dangerous situations. The underlying 
hypothesis for seizure prediction is that there are changes 
in cerebral dynamics that may precede the clinical seizure 
by minutes to hours (reviewed in Sackellares, 2008). These 
changes may be local (i.e. near the seizure focus) or remote. 
These changes are not apparent with visual analysis of the 
EEG, even with intracranial recording arrays. Some groups 
have identified high frequency activity that may signal the 
onset of neocortical partial seizures, but this is an exam¬ 
ple of improved seizure detection, not prediction (Worrell 
etal., 2004, 2008; Bragin etal., 2010). Reliable seizure pred¬ 
ication has been challenging and even the most enthusiastic 
proponents of the prediction hypothesis acknowledge the 
difficulties with current algorithms (Lehnertz et al., 2007; 
Mormann et al., 2007; Andrzejak et al., 2009). 

Seizure prediction may be difficult due to rapid bistable 
state changes at the time of ictal onset in the neocortex 
(Suffczynski et al., 2006; Lopes da Silva et al., 2003). 

The mechanisms underlying a bistable state change 
may be quite different between primary generalized (e.g. 
absence) and partial epileptic seizures. A bistable state 
change may be more applicable to these primary general¬ 
ized seizures which have abrupt bilateral cerebral onset. 
In this paper, a different possible mechanism is presented 
under which seizure prediction would be difficult in some 
patients with focal seizure onset. 

Knowing, as we do, that partial seizures are a reflec¬ 
tion of transient abnormal regional network activity, it 
is reasonable to postulate that these seizures in at least 
some (perhaps many) patients result from abnormal neural 


networks (e.g. the epileptogenic zone) (Jacobs et al., 2000). 
We describe here a model of the epileptogenic zone where 
the epileptic focus is represented by an abnormal neu¬ 
ral network that has very slightly altered connectivity so 
that, while seizures only occur infrequently, they can be 
triggered by normal background activity originating from 
outside the epileptogenic zone. This background activity 
could be influenced by various physiologic factors (e.g. 
sleep), but nevertheless this background activity would not 
result in seizure activity in the non-epileptic brain. This 
does not discount the possibility that some changes in neural 
network synchrony may occur in the f "normal” brain since 
the cumulative lifetime incidence of unprovoked seizures 
approaches 4% (Hauser et al., 1993). Often these seizures 
are provoked (e.g. medications and alcohol) and less than 
half of these patients have recurrent seizures. The life¬ 
time cumulative risk of developing epilepsy only ranges 
from 1.4% to 3.3% (Krumholz et al., 2007; Berg and Shinnar, 
1991). In this model, however, where normal background 
activity, occasionally or rarely produces a seizure in abnor¬ 
mal regional networks, seizure prediction would be difficult 
since detectable preictal changes would not be present; the 
first changes would in fact be seizure initiation. 

Epileptic networks in neocortex or the hippocampus 
show anatomical changes compared to normal tissue (Jacobs 
et al., 2000; Sallin et al., 1995). These changes can progress 
with time (Sallin et al., 1995; Arellano et al., 2004). This 
could result in neuronal networks more amenable to seizure 
generation (electrical or clinical) over large regional areas. 
There is a complex interrelationship, much of it not well 
understood, between neurons which are dysfunctional and 
the neural networks which can promote seizures (Leussis 
and Heinrichs, 2007; Kumar et al., 2007; Swann et al., 
2007). Even in the non-epileptic brain, excitatory connec¬ 
tions predominate with 80-90% of synapses being excitatory 
(Braitenberg and Schuz, 1998). 

With neuronal network simulations it is possible to con¬ 
trol, study, and quickly change the various influences on 
network behavior. Recently, we presented the results of 
computational simulation studies examining the role of 
external field stimulation on ongoing bursting activity in a 
neural network (Anderson et al., 2007, 2009). The corti¬ 
cal model used in these studies consists of discrete single 
compartment Hodgkin-Huxley type cells which are spa¬ 
tially arranged in a realistic fashion having both a layered 
and columnar structure. Since neural network behavior 
reflects the aggregate output of the component neurons, 
single compartment neurons allow greater computational 
efficiency and the ability to model larger networks in studies 
of network behavior. Arrangements of connected simu¬ 
lated neurons in this manner can demonstrate spontaneous 
bursting phases and have spatial characteristics similar to 
seizures recorded from humans (Anderson et al., 2007, 2009; 
Kudela et al., 1997, 2003a,b, 2005; Franaszczuk et al., 
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2003). We now present the results of a similar neuronal 
network model with random surrounding background inputs. 
The goal of this study was to investigate the statistical struc¬ 
ture of the resulting bursting network activity to seek the 
presence or absence of predictable patterns in the behavior. 

Materials and methods 
Computational model format 

The individual neurons in this neocortical model were represented 
by single compartment neurons bearing synaptic connections from 
the rest of the network, and embedded with a fixed set of ionic 
conductances. The membrane potential varies as: 

dV 

Cm = /syn — ^Na ~~ ^Ca ~ ~ ^K(Ca) — — ^L- 

The individual currents include the input synaptic current / syri , 
inward sodium and calcium currents /n 3 and / Ca , outward potas¬ 
sium currents including a delayed rectifier current /k, a calcium 
dependent potassium current / K (c a )> a transient potassium current 
/a, and a leakage current / L (Av-Ron, 1994). The minicolumns used 
in the simulation consist of 16 cells with intrinsic intracolumnar 
wiring, adapted from the neocortical work of Douglas and Martin 
(2004), as described more fully in previous studies (Anderson et al., 
2007, 2009). This is both for its ease of implementation compu¬ 
tationally and for its experimental support in somatosensory and 
visual cortex (Douglas and Martin, 2004). The geometry imposed on 
a computational model becomes relevant when studying any spa¬ 
tially dependent effects on the resultant spreading activity. The 
minicolumns in this simulation have a 25 |xm center-to-center spac¬ 
ing in a square lattice repeating structure. The total number of 
cells examined was 65,536, representing a simulated cortical sur¬ 
face area of 1.6 mm x 1.6 mm. Fig. 1 demonstrates a schematic of 
the intracolumnar excitatory cell connections and the organization 
of the minicolumns in planar space as well as snapshots of the 
resultant activity in the layer ll/lll pyramidal cell component dur¬ 
ing model bursting activity. The model connectivity and synaptic 
currents are described further in the Supplementary material. 

The base connection pattern studied in this report is represen¬ 
tative of one that can produce robust bursting as previously studied 
(Anderson et al., 2007). The numbers of extra-columnar connec¬ 
tions formed by each cell class are presented in Supplementary 
Table 1 . There are seven cell classes modeled: four classes of exci¬ 
tatory cells including layer ll/lll pyramidal cells, layer IV stellate 
cells, layer V pyramidal cells, and layer VI pyramidal cells, and 
three classes of interneurons including basket cells, double bouquet 
cells, and chandelier cells. Most of the model changes described 
in the described studies involve alterations in connection num¬ 
bers between layer ll/lll pyramidal cells, one of the known robust 
horizontal connections systems in the cortex supporting epileptic 
propagation (Telfeian and Connors, 1998). The base connection for 
this system, N 2/ 3 : 2/3 = 178, is defined as the number of layer ll/lll 
pyramidal cells a given layer ll/lll pyramidal cell contacts in its 
axonal distribution. 

The model in general illustrates consistent bursting behavior, 
with epochs of spontaneous bursting onset and cessation given a 
random background input of Poisson based charge injection to 1% 
of the cells in the model. This is an effort to treat the underlying 
cortical activity as input from neighboring cortex, with the model 
itself treated as the epileptic focus given its ability to produce net¬ 
work bursting epochs. The synaptic input used for the background 
was not periodic in nature. Average rates for these Poisson distri¬ 
butions are described in "Activity changes with mean background 
frequency” section and Fig. 2. The synaptic activations used for the 


background inputs were the same used in the cell to cell connec¬ 
tions, and followed the same rise and decay times appropriate for 
postsynaptic potentials. 

The pseudo-random number generator used for the application 
of the noise pulses was a linear congruential generator implemented 
with the C-function drand48, with an intrinsic period of 281 x 10 12 . 
For a 30 s simulation and 10 -5 s time-step, this function was called 
1.966 x 10 9 times for 1% of the cells undergoing background input. 
The period length for the pseudo-random generator is 143,000 times 
larger than this number. 

Statistics and analysis 

The interbursting phase intervals in the model were fit with a 
gamma distribution (Suffczynski et al., 2005, 2006). The functional 
form of this distribution /(...) is given by 

/(At) = (jrr(a))- 1 A t a ~' i exp . 

where At is the interbursting phase interval, a is the shape 
parameter, p is the scale parameter, and r{...) represents the 
gamma function. Parameters were estimated using the MATLAB 
function gamfit which returns maximum likelihood estimates and 
95% confidence intervals for the shape and scale parameters. A non- 
parametric test of randomness was used to attempt to establish 
whether intervals were being drawn at random from their underly¬ 
ing joint distribution. This was based on the circular definition of 
the lag-1 serial correlation coefficient (Wald and Wolfowitz, 1943). 
p-Values were computed under the assumption of asymptotic nor¬ 
mality of the test statistic. 

A method for the detection of unstable periodic orbits (of period- 
1) in successive interbursting phase intervals was applied (So et al., 
1996, 1997). A period-1 orbit is a fixed point of the nonlinear map 
expressing the evolution of the state of a system, iterated a single 
time (Guckenheimer and Holmes, 1983). Intervals were embedded 
in a two dimensional state space and 10 4 sets of transformed inter¬ 
vals were obtained after randomization. Dimensional reduction was 
instituted using circles of radius 9.4 ms centered along the diago¬ 
nal of the state space (Le Van Quyen et al., 1997). One hundred 
surrogates were produced to test the significance of peaks which 
appeared along this diagonal. The surrogates were generated using 
the amplitude adjusted Fourier transform algorithm (Theiler et al., 
1992). This shuffles the original sequence of interbursting phase 
intervals, maintaining the original amplitude distribution of the 
data while approximately matching its Fourier power spectrum. 

Results 

A total of 1600 s of discontinuous 20- and 30-s data seg¬ 
ments were obtained, holding the base connectivity of the 
layer ll/lll pyramidal cells to layer ll/lll pyramidal cells at 
N 2 / 3 : 2/3 = 178. Only the random number seed supplied to the 
background input generator was varied for each of these 
runs. Additionally, five continuous segments of data were 
obtained with the base connectivity set at N 2 / 3 : 2/3 = 172 of 
lengths 320 s, 250 s and N 2/ 3 : 2/3 = 178 of lengths 195 s, 140 s, 
and 208 s. These data were used for the dynamical sys¬ 
tems analysis presented below. In addition to these data, 
sixteen 20 -s runs were obtained with the model while vary¬ 
ing the mean background input frequency at the base level 
of connectivity. Five 20-s runs were obtained at the base 
connectivity while varying the temporal pattern of the back¬ 
ground input, and ten 20 -s runs were obtained with a fixed 
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Figure 1 (A) Representative connectivity of the excitatory cellular component in a given modeled minicolumn, wiring after 

(Douglas and Martin, 2004). (B) Three dimensional arrangement of the 16x16 array of minicolumns in space. (C) Representative 
snapshots of evolving activity over 0.02 s in the layer ll/lll pyramidal cell component. Each pixel represents one cell, color coded 
proportionally to the number of action potentials fired in bins of 1 / 100 of a second. 
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Figure 2 Network activity produced by sequential increases in the mean frequency of the applied background activity (background 
synaptic input provided to a fixed 1% set of the modeled cells, summed layer ll/lll pyramidal cell action potentials in 10 ms bins). 
The model exhibits a transition from episodic bursting to a very regular bursting behavior driven by the background input. 


sequence of background input while varying the overall layer 
ll/lll to layer ll/lll connectivity (N 2 / 3:2/3). 


Activity changes with mean background frequency 

These experiments were performed with the base connec¬ 
tivity of the layer ll/lll pyramidal cell system set at the 
base value of N 2 / 3 : 2/3 = 178. If the mean frequency of the 
applied background synaptic input is varied from 0.25/s 
up to 10/s, several patterns of activity become apparent 
(Fig. 2). At 0.25/s, only the low level set of activity pro¬ 
duced with action potential production by the background 
input is observed, at the cells where the input takes place. 
This plot essentially demonstrates the Poisson-based random 
network activity between the bursting phases. At 0.5/s, spo¬ 
radic bursting activity transmitted to the network as a whole 
can be observed, with long quiescent epochs. As the applied 
mean background frequency is further increased, longer and 
longer periods of constant bursting activity can be observed 
up to 5/s. After this, a second activity transition is observed, 
in which the activity changes from continuous bursting into 
short periods of very large amplitude bursting (in terms of 
numbers of active neurons) punctuated by brief periods of 
quiescence. At 10/s it appears to dominate the activity. This 
implies a saturation mechanism in this class of connected 
network, which comes into play after a critical percentage 
of cells are excited per time step. This saturation behav¬ 
ior is again an intrinsic property of the fixed network being 
probed. Additionally, these studies imply that seizure onset 
can be driven by neighboring cortical activity, albeit regu¬ 
lar patterns that might not be typical of random background 
input activity utilized here. 


Network activity altered with input pattern 
changes 

Within the context of this model, it is possible to change the 
random pattern of connectivity between represented cells, 
and still keep the total number of connections between the 
various cell classes constant. By varying the random number 
seed supplied to the generator distributing the connections, 
different patterns of activity can be demonstrated, even 
with the same application of underlying cortical activity 
applied to the same cells. Examples of the changes in activ¬ 
ity are presented in Fig. 3A. The pattern produced ranges 
from almost constant bursting throughout the 20 s exam¬ 
ined, to brief periods of on and off bursting. Similarly, the 
connectional pattern can be held constant along with the 
cells in which the background activity is applied, while vary¬ 
ing the random number seed responsible for producing the 
order in which background pulses are injected into the cells. 
This produces similar alterations in network activity demon¬ 
strated in Fig. 3B, and can include several time scales of 
bursting epochs. These studies imply a rich dynamics of 
stochastic behavior in randomly connected neural networks 
receiving temporally uncorrelated background input, and 
again point toward difficulties in predicting when the burst¬ 
ing phases might begin. 

Network activity is very sensitive to numbers of 
excitatory connections 

Finally, changing the numbers of connections in this net¬ 
work model can produce substantial alterations in network 
behavior. Fig. 4 presents a sequence of plots of the layer 




























































































































































































































79 


a BDDD 



4DDG 


0 








6*00 




BDDD 



□ 10 30 

Time (sec) 


am 



Time (sec) 


Figure 3 (A) Network activity induced by varying the random connectivity pattern between cell classes (different connectivity 

seeds, , for the random number generator). Numbers of action potentials in layer ll/lll pyramidal cell component, 10 ms time 
bins. (B) Network activity induced by varying the random time sequence of background synaptic input, In these experiments, 
all cellular connections remain fixed, and the identity of the cells undergoing background synaptic input remain fixed. 


ll/lll pyramidal cell activity (time-binned action potential 
numbers) for various degrees of extracolumnar connectivity 
between the layer ll/lll pyramidal cells. The base activ¬ 
ity explored in this manuscript is shown in the plot with 
total number of connections held at N 2 / 3 : 2/3 = 178 (between 
extra-minicolumnar layer ll/lll pyramidal cells). A rapid 
reduction in network bursting is shown for a connection num¬ 
ber reduced below this, and almost continuous activity is 
shown for connection numbers above this. Fig. 3A and B data 
were obtained with the connectivity set at N 2 / 3 : 2/3 = 178. 
Only the random pattern of connectivity is varied in Fig. 3A 
and the time sequence of background input in Fig. 3B. These 
studies were performed in the context of a constant aver¬ 
age level of surrounding background input, and imply the 
importance of internal connectivity in the development of 
uncontrolled bursting of the network. 


Statistical analysis of interburst phase intervals 

The statistical properties of the interburst phase intervals 
for five continuous runs of the model at connectivities of 
^ 2 / 3 : 2/3 = 178 and 172 were analyzed. This was motivated 
from a dynamical systems perspective, where periodicity in 
sequential intervals was sought for. Fig. 5A displays a his¬ 
togram approximation to the probability density function 
for interbursting phase intervals for Continuous Run 1, which 
consisted of 163 intervals collected from a 320 s run of the 
simulation. Fig. 5D displays the same histogram approxi¬ 
mation for Continuous Run 4, with a total of 58 intervals 
collected from a 140 s run of the simulation. Gamma distri¬ 
butions were used to fit these densities (Fig. 5A and D (blue 
traces)) (see "Materials and methods" section; Suffczynski 
et al., 2005, 2006). In the case of Continuous Run 1 
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Figure 4 Changing the absolute connectivity in the layer ll/lll pyramidal cell component (number of layer ll/lll pyramidal cells 
contacted by a given layer ll/lll pyramidal cell, N 2 / 3 : 2 / 3 ) in the model produces alterations in the network bursting behavior. At 
very low absolute connectivity (N 2 / 3 : 2/3 = 110) network bursting is brief and isolated, while at higher levels of absolute connectivity 
periods of constant bursting can be observed. 


(Fig. 5A) we found a = 6.09 (4.61-8.04) for the shape param¬ 
eter of the distribution, and /3 = 0.26 (0.20-0.35) for the 
scale parameter (95% confidence intervals in parentheses). 
For Continuous Run 4 (Fig. 5D), the corresponding values 
werea = 7.14 (5.00-10.19), and /3 = 0.23 (0.16-0.33) (simi¬ 
lar results were obtained for 3 further continuous runs and 
one discontinuous run - results not shown). In accord with 
the interpretation of Suffczynski et al. (2005, 2006), the fact 
that the shape parameter a was larger than one in all runs, 
suggests the potential presence of periodicity in the genera¬ 
tion of bursting epochs. To probe this link further, additional 
statistical tests were performed as described below. 

To ascertain how intervals were being drawn from their 
underlying joint distribution, we applied a non-parametric 
test of randomness to the interbursting phase intervals (Wald 
and Wolfowitz, 1943). In the case of Continuous Run 1, we 
found that one cannot reject the null hypothesis of ran¬ 
domness at the 5% significance level (p-value 0.93). For 
Continuous Run 4, we found the same conclusion at the 5% 
significance level (p-value 0.29). This conclusion was also 
borne out for the remaining three continuous runs. 

A method for the detection of unstable periodic orbits 
(of period- 1 ) was then applied to test for the presence 
of deterministic dynamics in the generation of interburst¬ 
ing phase intervals (So et al., 1996, 1997). This method 
institutes a transformation of the sequence of intervals such 


that the transformed sequence is clustered around locations 
of potential periodic orbits. One can compare the peaks of 
these clusters to those generated by surrogate data (Theiler 
et al., 1992), to compute the statistical significance of the 
peaks, and thereby ascertain the potential existence of peri¬ 
odic orbits in the data. Fig. 5C and F shows the peaks of the 
clusters were not significantly greater than those generated 
by surrogate data (see caption), and so no period -1 orbits 
were detected for either run (nor for the remaining three 
continuous runs), at the limit of detection in the current 
data set. 


Discussion 

Our results demonstrate that while holding the mean prop¬ 
erties of the network stable (mean connectivity numbers, 
mean background excitation rates), very rich and strikingly 
different dynamics are produced by changing the model 
details. Epileptogenic behavior can be created in these net¬ 
works, as described above, by changes in the random pattern 
of connectivity, while holding fixed the intrinsic active or 
passive membrane properties in the constituent neurons. 
Such changes in connectivity could be analogous to changes 
in underlying connectivity that might occur following 
cerebral insults, or repetitive seizures. Similar modeling 



























































81 



Interval between bursting epochs {sec} 



w 




Interval between bursting epochs (sec) 



Figure 5 Statistical analysis of the interbursting phase intervals. (A and D) Interval histograms are fit with gamma distributions 
revealing shape parameters a> 1 in the case of Continuous Run 1 (A) and Continuous Run 4 (D). (B and E) Two dimensional delay 
embedding of sequential intervals (Spearman correlation coefficients are not significantly different from zero) for Continuous Runs 
1 and 4, respectively. (C and F) Testing the significance of potential period-1 orbits detected through a dynamical analysis of the 
sequence of points displayed in (B) and (E), respectively. The y-axis represents the fraction of surrogate (shuffled, see f 'Materials 
and methods” section) sequences with a maximal deviation from the mean surrogate result of greater than W (So et al., 1996, 
1997) (10 4 random matrices and 10 2 surrogates were used). The horizontal dotted (red) line displays the maximal deviation for the 
simulation data. Since there exists a significant fraction of surrogates with deviation greater than that for the simulation data (for 
both C - 20% and F - 10%), neither plot displays convincing evidence of the existence of a period-1 orbit. (For interpretation of 
the references to color in this figure legend, the reader is referred to the web version of this article.) 
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work has been performed on changes in connectivity with 
resultant epileptic effects in hippocampus (Morgan and 
Soltesz, 2008; Dyhrfjeld-Johnsen et al., 2007). In our results, 
local increases or decreases in connectivity in the model may 
alter the network in a similar manner (a tempting compar¬ 
ison which must be tempered by our lack of understanding 
of the functional significance of new or absent connections, 
see Sallin et al., 1995; Dinocourt et al., 2003; Marco et al., 
1997; Dudek and Sutula, 2007; Magloscky, 2010). In the 
model above, the external background activity can be held 
constant or changed in a variety of time-frequency manip¬ 
ulations. Various simulations can be created where seizures 
occur rarely or very frequently. 

Similarly, human epileptic seizures are episodic, tran¬ 
sient events. Whether epileptic seizures are random events 
is not clear, but times of ictal onset can behave as a random 
process (Suffczynski et al., 2006). In some patients there 
are no identifiable contributing factors, in other patients 
such conditions such as sleep deprivation may increase the 
chance of seizure occurrence and in still other patients 
seizures can be provoked by specific stimuli (e.g. hyperven¬ 
tilation, intermittent photic stimulation; Lu et al., 2008; 
Vinogradova et al., 2009; Kaplan et al., 2009). In these 
examples there may be resulting alterations of background 
activity within and outside the epileptogenic zone (not evi¬ 
dent from the EEG) that make it more likely that a seizure 
may arise from the existing focal epileptogenic network. 
While these influences might create a state where seizures 
are more likely to occur, this facilitatory state should be dis¬ 
tinguished from the presence (or absence) of a preictal state 
in unprovoked seizures. 

Random behavior of network 

One interesting aspect of this model is its ability to demon¬ 
strate both spontaneous periods of bursting activity as 
well as self-termination of the bursting. As illustrated in 
Figs. 2-5, these periods of bursting can be quite variable in 
their length. Throughout the bursting and quiescent phases, 
the distributed background activity is constantly active, 
affecting approximately 1% of the total cells. We believe 
this model may represent the type of network behavior 
described by Lopes da Silva et al. (2003), in which epileptic 
activity within the network cannot be predicted from the 
interictal state. The non-parametric and nonlinear dynami¬ 
cal analyses described in the "Results” section support this, 
however inference from this is somewhat limited given the 
finite size of the data set, the small region of modeled area, 
and the gamma distribution fits described earlier. 

This obviously may not be true of all types of clini¬ 
cal and model epileptiform behavior. For example Osorio 
et al. (2009, 2010) show that pharmacoresistant seizures 
tend to cluster, and may have an inherent self-triggering 
capacity. This might make a prediction algorithm possible 
to implement in a useful fashion. Others (Suffczynski et al., 
2005, 2006), however demonstrate that seizure onset can 
be described in both experimental and model data as a ran¬ 
dom walk process, with possibly a deterministic mechanism 
ascribed to seizure termination. The random onset nature 
would (in a bistable network with Poisson transitions) be 
difficult to predict. 


In the case of our model the underlying fluctuations lead¬ 
ing to seizure onset are the random background activity we 
have imposed on the network. The properties of the network 
connectivity then support the bursting frequency observed 
(Anderson et al., 2007). These types of grossly synchronized 
bursting states in the context of a neural network have 
been studied extensively by Kowalski et al. (1992). They are 
truly pathological in the sense that they would block or con¬ 
found any information flow through this network. It is also a 
network-generated state, and can be stopped by eliminating 
synaptic transmission (Kowalski et al., 1992; Keefer et al., 
2001; Rhoades and Gross, 1994). 

Many limiting cases of the gamma distribution have phys¬ 
ical interpretations that might make it easier to understand 
spiking data from cortex when used for fitting (Papoulis, 
1984; Suffczynski et al., 2005). When the shape parameter, 
a, in the gamma distribution is an integer, the distribution 
is known as an Erlang distribution and represents the prob¬ 
ability distribution of the waiting time until the ath event 
from a sampled Poisson process with characteristic time p. 
This might be comparable to the distribution of the number 
of spiking or underlying synaptic events required to trigger 
the network bursting behavior. One could envision trying to 
extract the integer value of a from either computational 
or experimental data. Similarly, the Maxwell-Boltzmann 
distribution can be related to the gamma distribution 
under certain restrictions on the gamma scale and shape 
parameters, implying the possibility of extracting almost 
thermal-like or statistical mechanical interpretations of the 
network activity (Hegyi, 1996). 

Limitations of the model 

Our plots demonstrate gross summed numbers of time- 
binned action potentials in the model for given neuron 
classes. The interictal activity is a random Poisson input to 
1 % of the cells in the model and is demonstrated in the plots, 
particularly in Fig. 2, Panel 1, inset. We chose to impose 
this random interictal behavior on the model to demon¬ 
strate that these fluctuations can produce very coherent 
synchronous oscillations in an unpredictable fashion. How¬ 
ever, the surrounding input might not have to be completely 
random to bring about the same effect. Epilepsies involving 
specific stimuli might require a coherent surrounding input 
to give rise to the seizure (Lu et al., 2008; Vinogradova et al., 
2009; Kaplan et al., 2009). A more realistic technique would 
be to treat the interictal background as a log-normal process 
which does have some support in the literature (see Farkhooi 
et al., 2009; Waters and Helmchen, 2006). Newer recording 
methods from invasively monitored epilepsy patients might 
help determine what patterns of background activity are 
causative (Truccolo et al., 2011; van Gompel et al., 2008). 
Our intent was to represent the resting interictal cellular 
activity as fundamentally sparse with a random component. 
This was most easily implemented as a low frequency Poisson 
process. 

It is possible to view the single synaptic input (driven 
by the background source) as representing several weaker 
but synchronized inputs. This represents a limitation to this 
modeling approach, a limitation that in large degree could 
be corrected with more elaborate multicompartment and 


83 


synaptic representations of the cells to make the multiple 
weaker inputs more independent. Additionally in this lim¬ 
ited data set, we are unable to say much about time epochs 
larger than several hundreds of seconds (our largest con¬ 
tinuous simulation being 320s). This is clearly a limitation 
in this technique and future computational work, including 
efforts in our laboratory will explore longer time intervals 
of ictal and interictal behavior, and possibly push the detec¬ 
tion limit for predictable activity lower (or detect it more 
accurately). Models such as this particular rigid crystalline 
arrangement of neurons with many fixed cellular properties 
in some sense have less inherent "randomness” than real 
neocortex. This work was primarily meant to spark inter¬ 
est in a possible mechanism for the difficulties inherent in 
seizure prediction, but by no means should it be interpreted 
too literally. 

Implications for seizure prediction 

The purpose of the model presented here is not to judge the 
effectiveness of seizure prediction, but rather to present a 
plausible, alternative hypothesis for partial seizure occur¬ 
rence that could explain situations where seizure prediction 
may not be possible. Indeed, it is conceptually attractive to 
consider that, just as partial seizures may result from vari¬ 
ous pathologies and mechanisms, that some partial seizures 
may not be reliably predicted. It is beyond the scope of 
this discussion to address the various methods being used 
in attempts to predict epileptic seizures. It is always impor¬ 
tant to differentiate true seizure prediction from improved 
seizure detection. Other commentaries and reviews address 
these methods and include discussions of the challenges 
and frustrations to date in routine seizure prediction even 
with intracranial electrodes (Estellar et al., 2001; Litt and 
Echauz, 2002; Sackellares et al., 2006; Haas et al., 2007; 
Osorio et al., 2001). 

This study focused on the interval to the time of the 
next "seizure” or busting phase in the model. Our interest 
was in the occurrences of the transitions from the quiescent 
or background state into the pathologic state, since that 
is what most seizure prediction algorithms are optimized 
for. There is fairly strong evidence, certainly in the case of 
complex partial seizures in temporal lobe epilepsy, that the 
length of clinical seizures can be fairly uniform in a given 
patient (see for instance Afra et al., 2008). The time inter¬ 
val durations of the seizures themselves may also prove to 
have to predictive guidance as well and should be explored 
in the future in modeling efforts. This may be more useful 
in the case of neocortical epilepsy with its rapid spread and 
possible involvement of larger regions of tissue. 

Additionally, this model can incorporate incremental 
changes in connectivity in the epileptogenic zone, changes 
that could be a model for progressive epileptogenesis (e.g. 
sprouting). This type of model also provides data that is 
comparable to clinical data from epilepsy patients. The 
simulated network activity is taken from a small region of 
modeled cortex comparable in size to the surface area under 
a typical subdural grid electrode, and makes comparisons 
between modeling efforts and clinical data easy to perform 
(Anderson et al., 2007, 2009; Kudela et al., 1997, 2003a,b; 
Franaszczuk et al., 2005). Indeed the major advantage of 


neuronal network modeling is the ability to simultaneously 
monitor activity in all of the network neurons under given 
experimental conditions, something not possible with bio¬ 
logical systems, even with sophisticated recording arrays. 
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Evolving communicative complexity: 
insights from rodents and beyond 
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Laboratory, Aberdeen Proving Ground, MD 21005, USA 

Social living goes hand in hand with communication, but the details of this relationship are rarely 
simple. Complex communication may be described by attributes as diverse as a species’ entire 
repertoire, signallers’ individualistic signatures, or complex acoustic phenomena within single 
calls. Similarly, attributes of social complexity are diverse and may include group size, social role 
diversity, or networks of interactions and relationships. How these different attributes of social 
and communicative complexity co-evolve is an active question in behavioural ecology. Sciurid 
rodents (ground squirrels, prairie dogs and marmots) provide an excellent model system for study¬ 
ing these questions. Sciurid studies have found that demographic role complexity predicts alarm call 
repertoire size, while social group size predicts alarm call individuality. Along with other taxa, sciur- 
ids reveal an important insight: different attributes of sociality are linked to different attributes of 
communication. By breaking social and communicative complexity down to different attributes, 
focused studies can better untangle the underlying evolutionary relationships and move us closer 
to a comprehensive theory of how sociality and communication evolve. 

Keywords: sociality; individuality; repertoire size; alarm call; information theory; vocal complexity 


1. WHAT IS COMPLEXITY? 

While most of us have an intuitive idea of what constitu¬ 
tes complexity, complexity is difficult to define [1]. 
A system is typically considered more complex if it con¬ 
tains more parts, more variability or types of parts, more 
connections or types of connections between parts or 
more layers of embedded meaning. The information 
[2] required to describe a system can be thought of as 
a measure of the system’s complexity (‘Kolmogorov 
complexity’, after [3]). To illustrate, Dawkins [4, 
p. 265] provides a simple thought experiment: when 
comparing two items or systems, imagine writing a 
book to describe each one. The longer book will 
describe the more complex system. Information-based 
definitions of complexity are most useful because they 
allow complexity to be quantified and compared with 
a single metric [5]. 

Social complexity can be defined in numerous ways 
(e.g. the number of individuals [6-8], the number of 
demographic or social roles [9,10], the strength 
of social bonds [11-13], the complexity of group sub¬ 
structure and relationships [14-18] and combinations 
of these [19,20]). Each of these definitions can be con¬ 
sidered an attribute of sociality. What these definitions 
have in common is they quantify social complexity via 
the number or variability of the social system’s parts, 
connections or layers of categorization (table 1). 

* Author for correspondence (kpollard@ucla.edu). 

One contribution of 13 to a Theme Issue ‘The social network and 
communicative complexity in animals’. 


One integral attribute of social complexity is per¬ 
haps the most straightforward: social group size. The 
number of individual animals present in a social group 
influences that group’s resource needs [21], disease 
ecology [22] and predation risk [23]. Furthermore, 
social group size defines boundaries for other attributes 
of social complexity, such as mating systems or social 
networks [24] . Group size also directly affects the com¬ 
municative landscape, since more individuals may be 
communicated with, followed, discriminated or ignored 
[5,25-27]. Social group size is often straightforward to 
quantify. If a social group’s boundaries are well defined, 
then a simple head count is all that is needed. For all 
these reasons, it is not surprising that social group size 
is one of the most commonly studied attributes of 
social complexity and has been used in a variety of 
taxa [7,28-38]. 

Another critical attribute of social complexity is the 
number or variability of social roles in a social group. 
A role can be thought of as a socially expected behav¬ 
ioural pattern. For instance, social roles may be based 
on whether an individual is a dominant or subordinate, 
a producer or scrounger, or a breeder or non-breeder. 
Blumstein St Armitage [39] quantified social roles by 
focusing on demographic (age/sex) roles in social 
groups, which can be quantified using information 
theory. The key assumption was that groups with 
more overlapping generations, and those with more 
age-sex classes present, were more socially complex 
in their demographic roles. 

Complexity in communication can be defined analo¬ 
gously to complexity in sociality, i.e. via the number or 




90 


<D 

> 

■ t-H 

4-J 

C/D 

05 

hg 

X 

<L> 

$-1 

O 

g 

> 

• ^H 

C/3 

jo 

T) 

x 

<D 

13 

G 

4- > 

a 

5- 1 

<D 

hG 

■M 

• t-H 

<L> 

a 

<u 

aS 

<L> 

U 

<D 

hG 

73 

<u 

■M 

C/D 

<u 

bJD 

bJD 

C/D 

C/D 

<D 

• ^H 

5-h 

O 

bJD 

<D 

■M 

aS 

U 

<L> 

hG 

h 

■M 

• ^H 

X 

JD 

Eh 

S 

o 

o 


>4 

(-1 

aS 

> 

a 

03 

U 

•M 

aS 

hG 

4-> 

C/D 

a 

<D 

■M 

C/D 

>> 

C/D 

G 

o 

• t-H 

■M 

aS 

U 

• t—H 

a 

G 

a 

a 

o 

u 

73 

a 

OS 

C/D 

a 

<D 

4-j 

C/D 

>> 

C/D 

13 

• t-H 

U 

O 73 

C/D OJ 

<+H 


Oh 

S 


C/D 

<u 


G 2 

-o c3 

• ^H 

5—1 r/3 

4-J rD 
4-» Oh 

OS *p 

'S 

G 

o 


G 

o 


aS 


O «L> 

<U 5-i 

1 S 

<1 « 
. o 


o 

G 




,0 73 
G 

H c3 


>> 


x> 

as 

• ^H 

5-h 

aS 

> 

73 

G 

aS 

C/D 

C/D 

<D 

G 

G 

D 

Oh 

O 


5-h 

o 

G 

o 

• t-H 

■M 

aS 
N 
" p 
O 
bf) 

<u 

■M 

aS 

O 

r\ 

<D 

$-i 

G 

■M 

O 

G 

H 

4— 1 

C/D 

<- 1-1 bJD 

o a 

• ^H 

g G 
6 * G 


C/D 

4-J 

$-1 

aS 

Oh 


G 

a> 

<u 

£ 

•M 

o 

t 0 

C/D 

G 

o 

• ^H 

o 

<u 

G 

G 

o 

o 


<D 

4 — 1 

C/D 

>> 

C/D 

<+H 

o 

CO C/3 
(L) 4-J 

Oh 3 
^ n 

4-J Oh 


C/D 

4-J 

$-1 

aS 

Oh 


<L> 

4- J 

c/D 

>> 

C/D 

<+H 

o 

5- 1 
<L> 

a 

G 

G 


<+H 

o 

4-J 
• t-H 

r-J a) 

'o <L> 

4-J 4-J 

aS 3 

■g ,£> 

g *C 

4-J 

>> 2 
•G u 
73 O 
'G co 
G 
qG 


Oh 

• ^H 

4h 

C/D O 

5-i *G 

a> i-G 


C/D 
C/D flj 


O 

5-1 


3 a p 
E 

a a g 

§<13 3 


o o p, 

& o £> 

bJD co 5-1 


<L) 

5-i 

G 

•M 

o 

G 

H C/D 

»o G 

Oh 

O S 

bJD 


73 

<D ^4 
UG 5-i 
• O o 

G ^ 

03 

G 
O' 


<D 

U 

B G 
O O 

r* c/d 

e a 

5-1 I 

Oh C 

• t*h g .2 

■j—* C/D C/D 

CO 

OS Oh CP 

G 

o 

5-1 
bJD 


,0 -G 

H — 1 

a sfi 

as 


C/D 

G 

o 


o 

o 


C/D 

G co 
O « 

-0 *-* 
aS —i 
bJD ctf 

a e 

W) g 
W> Oh 

rrl C/3 

On'? 
G A 

O 25 

5-h O 
W) Oh 


<D 

G 


C/D 

73 

g (U 13 

x> ^ 2 
E ^ 9 
13 g 5/5 

o S 

o 

C/D 


73 

G 

aS 

C/D 

U 

• I—( 

■M 

C/D 


5-1 

o 


C/D 


C/D 

<u 

5-i 

G 


C/D 

<D 


G 

y o 

a w) 

C/D 03 


2 G3 
G U 

tl 2 

CO g 
2 •- 

5-1 Cd 

c3 'o 

a s 


5-1 

G 

O 

rG 

bJD 

<D 

G 


03 


C/D 
5-1 

O 


<D 

G 


5-i 

5-1 

<D 


O 


> 


aS 


^3 
Oh '3 
aS 9 

5-1 G 

O o 


s 

<D 


„ U 

co Q 

^ S 73 
O 

5-i 


<L> 

N 

C/3 

Oh 

G 

o 

N & 

CO co 

Oh 'O 
G O 
o co 
5-i 

bJD 


5-1 

Oh 

<D 

5-1 


bJD 

G 

bJD 

aS 

5-1 

<2 

<D 

N 

• i —i 

C/D 


G 

o 

’M 

S) -p 

2 G 
bC 
bJD 
as 


G 

O 

E 

o 


a 

<L> 

N 

o 

C/3 

u 

Oh 

5-h 

G 

o 

o 

5-h 

W> 


H-J 

X 

3 ) 

Eh 

S 

o 

o 

13 

• t-H 

o 

o 

C/D 


Chh 

O 

G 

o 

« i-H 

H— 1 

aS 

O 

qG 

73 

O 

s 

73 

G 

aS 

C/D 

W> 72 

G 173 


G 

5-1 

aS 

<U 


G 

bJD 

» t-H 

C/D 


C/D 

<D 

5-i 


5-1 
<L> 

Oh 
<D 
5-i 

13 
G 
bJD 

• 

C/3 

q-i 
O 

co G 
co >> 
a> c/d 

S ^3 

S S 

o, 

o 


X 

as 


G 

£>.2 
bG -m 

d u 

•§ § 

a S 

a n 

73 
G . 

.2 5=1 
G a> 
G 5-1 

I ^ 

<2 a 

<D 

5-i 


5-i 

o 

G 

o 

’M 

73 

G 

O 


O 

o 


aS 

G 

C/D 

CN 



X 

E 

• ^H 

b/3 

5-h 



dj 


• t-H 

C/3 

<U 


O 

t , 

C/D 

G 

<D 

<h-h 

13 


y-H 

G 

<L) 

73 

O 

G 


4— • 

bD 

• t-H 

G 

bJD 

« t-H 

<D 

03 

p 

os 


o 

• 

C/D 

4-* 

05 

t-H 

bJD 

G 

G 

4 — 1 

5-h 

4 — 1 

• ^H 

O 5-h 

73 

aS 

o 

C/D 

C/3 

.y <d 

• ^H 

U 

• ^H 


<D 

73 

4 — 1 ^ 

G rrt 

> 

• t-H 


G 

s 


o 

o 


ri 3 3 C 
o o 73 % 
W) o 

G q-i ,0 g 
G os p .G 

<u 


a.a 

C/3 


G 

<u 

o 

tO 

C/D 

G 

o 


C/D 


05 


O OS 

X O p 

G ' 
G 
O 
o 


G 

C/D 


bJD 

t-H 

C/D 


aS 

<L) p 

.a a 


C/D 

<L> 

5-h 


C/D 


C/D 

<L) 


C/D 
C/D 

O 

4— 1 5-h 

5- h O Oh 
<U 03 >j 

a w ^ 

<u 

5-i 


03 

G 

bJD 


C/D 

05 

G 

• 

tG 


<u 

N 

C/3 

<u 

5-h 

2 9? 

5-h Oh 
O >> 

a ^ 

<D 

5-h 


<U 

5-h 


G 

05 

G 

• 

tG 


5-i 

o 


X 

<D 


5-1 £ 

a> 

Oh g 

a> .G 


o 

5-h 

• t-H 

•M 

G 

o 


5-1 

<D 

Oh 

<U 

5-h 


G 

X o 

<L> O 

CO 

•H cd 
co p 
co pi 

iH -2 
^ c3 

<L) G 

W).t! 

C/D 


> 




aS 

O X 


G 

G 

s 

a 

o 

u 


JJ 

E 


o 

u 





irmT 


no 


LLLEI 


III III 


golden 

D M. caudata 



:o 


black-capped 
M. camtschatica 



m 


alpine 

M. marmot a 




Vi onrv 



f< L L 

nudi y 

M. caligata 


P B i 

r h 


Olympic 
M. olympus 


yellow-bellied I 
10 M. flaviventris | * | 


woodchuck 


4- 


D M. monax r ' 

kHz 2- 



r i 


■ r 


Is 


no. alarm calls 

d equivocal □ 1 O 2 


□ 4 




V-“- - -■ 


Figure 1. Alarm call repertoires of seven species of marmots ( Marmota spp.). Sciurid rodents differ in the acoustic structure of 
their alarm calls and in the size of their alarm call repertoires. 


variability of the signal’s or system’s parts, connections, 
or layers of meaning (table 1). For acoustic communi¬ 
cation, attributes of communicative complexity can 
include a species’ repertoire size (number of call types 
[32,39]) or the syntactical complexity or uncertainty 
[40,41] of sequences of calls. Signals may also be complex 
in their association with external referents, such as func¬ 
tionally referential alarm calls that communicate 
predator type or response strategy, in addition to alarm 
[42-45]. Communication of urgency or affective state is 
another form of complexity, providing additional infor¬ 
mation about the situation [46-48]. For example, many 
species of marmots alter the rate of their alarm calls 
with the urgency of the situation or the degree of risk 
[49]. The presence of embedded signature information 
(such as age, sex or individual caller identity [50-54]) 
also adds complexity to communicative signals. 

As social complexity, however quantified, increases, 
more complex communication might be needed to 
manage this social complexity [55-59]. Attributes of 
social complexity may thus generate a need for animals 
to exhibit different types of communicative complexity. 

2. SCIURIDS AS A MODEL SYSTEM 

Sciurid rodents (Rodentia: Sciuridae), particularly 
ground-dwelling social species (tribe Marmotini, 
including ground squirrels Spermophilus spp. and re¬ 
lated genera, prairie dogs Cynomys spp., and marmots 
Marmota spp.), present an excellent model comparative 
system for studying the relationship between social and 
communicative complexity, for several reasons. 

First, ground-dwelling sciurids constitute a speciose 
clade that exhibits a wide range of social structures. 
On one end of the spectrum, some species are nearly 
solitary or live in simple family groups (e.g. 


woodchucks, Marmota monax , live in groups of a 
mother plus her young of the year). Within the same 
clade, more complicated group structures are seen, 
such as those in Eurasian marmots (e.g. M. caudata , 
M. marmota [60]) in which young delay dispersal for 
one, two or more years, adult males participate in 
group life, and females may aggregate in harems or 
matrilines. Social group size also varies considerably 
across ground-dwelling sciurids, as does group cohe¬ 
sion. In addition, sciurids also vary in their tiers or 
levels of social complexity [61,62], with some species 
exhibiting complex relationships between segregated 
social groups (e.g. some prairie dogs). Importantly, 
variation in social group size is not strictly tied to vari¬ 
ation in other attributes of social structure [9], 
allowing these different attributes to be teased apart 
in comparative study. 

Secondly, ground-dwelling sciurids offer a com¬ 
munication system amenable to comparative study. 
Ground-dwelling sciurids produce vocal alarm calls 
in response to predatory stimuli [63]. These calls are 
typically loud and perceptually salient, and they can 
be elicited and recorded by researchers. Within the 
sciurid alarm call system, an array of communicative 
complexity is exhibited. Species vary in the acoustic 
structure of their calls [39,54,64], the size of their 
alarm call repertoires [39,49] (figure 1), the response 
urgency or distance to predator encoded in their calls 
[65-67], syntactic or ordering differences in calls 
[64], and the degree to which age and sex and individ¬ 
ual identity are encoded [54,68]. As with social 
complexity, we can view these as attributes of commu¬ 
nicative complexity and they can be independently 
studied in comparative analyses. 

Thirdly, publishedphylogenies [69-71] aidphylogen- 
etically controlled evolutionary analyses in this taxon. 
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To date, two phylogenetically controlled studies have 
examined the correlated evolution of social complex¬ 
ity and communicative complexity in ground-dwelling 
sciurid rodents [39,54] . Importantly, comparative studies 
in sciurids permit us to decouple drivers of complexity, 
and, as discussed below, have revealed that social com¬ 
plexity, broadly defined, does not simply select for more 
complex communication. Rather, different attributes 
of sociality seemingly select for specific attributes of 
communicative complexity. 


3. DEMOGRAPHIC COMPLEXITY DRIVES 
REPERTOIRE SIZE EVOLUTION 

Blumstein & Armitage [39] used 22 species of ground¬ 
dwelling sciurids to test for a relationship between 
one attribute of social complexity (demographic role 
complexity) and one attribute of communicative com¬ 
plexity (alarm call repertoire size). They defined a 
social complexity index via information theory, using 
data on dispersal patterns, the age/sex composition of 
groups and variability in these traits [39] . The resulting 
numbers, expressed in bits, quantify the amount of 
information needed to describe the demographic role 
complexity in different species. 

For species with multiple alarm call types, different 
call types may be used to communicate about different 
types of predators [42,43,45], to communicate different 
degrees of risk or response urgency [46,48], or to com¬ 
municate both predator type and urgency [44,72]. 
Species living in more complex social groups, such as 
groups with more social demographic roles, may have 
greater need to signal alarm in a more complex 
manner and thus may use larger alarm call repertoires. 
Across ground-dwelling sciurids, alarm call repertoire 
size varies from one to five (see examples in figure 1). 
Blumstein & Armitage [39] used a combination of 
field recordings and literature review to establish the 
alarm call repertoire size for 22 species of sciurid and 
regressed this against the social complexity index. 
Demographic role complexity significantly explained 
variation in alarm call repertoire size, both in analyses 
of raw data and independent contrasts (figure 2). 

More up-to-date phylogenies have since been pub¬ 
lished, so we re-ran the independent contrasts analysis 
using more recent phylogeny taken from Herron et al, 
[70] and Steppan et al, [71]. The relationship between 
social demographic complexity and alarm call repertoire 
size remained significant (r = 0.228, y = 1.049x, p = 
0.025, n = 21 independent contrasts). An increase in 
social demographic roles appears to drive the ability to 
communicate via a larger, more complex repertoire of 
alarm calls. The functional explanation underlying this 
correlation is unknown. Future study of alarm call 
type use in different contexts or between different 
caller-receiver dyads may be necessary to untangle 
how more demographically complex species may benefit 
from a more diverse alarm call repertoire. 


4. SOCIAL GROUP SIZE DRIVES THE EVOLUTION 
OF INDIVIDUAL SIGNATURES 

Pollard & Blumstein [54] compared sciurid vocal signa¬ 
ture individuality (individual distinctiveness) against 


(yCC) 4 1 
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Figure 2. Relationship between social demographic complex¬ 
ity and alarm call repertoire size across 22 species of sciurid 
rodents, in raw (a) and independent contrasts (b) data. 
Social complexity (variability in demographic roles) correlates 
with alarm call repertoire size, (a) Adapted from Blumstein & 
Armitage [39], with permission from Chicago Journals. 
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Figure 3. Relationship between social group size and alarm call 
individuality across eight species of sciurid rodents. In raw (a) 
and independent contrasts (b) data, social group size correlates 
with vocal individuality (H s ) in sciurid alarm calls (r 2 > 0.88, 
p < 0.001 in both cases). Species in (a) are Cynomys leucurus 
(CYLE), C. ludovicianus (CYLU), Marmota flaviventris 
(MAFL), M. olympus (MAOL), Spermophilus beecheyi 
(SPBE), S', beldingi (SPBL), S. richardsonii (SPRI) and 
S. tridecemlineatus (SPTR). Figure adapted from Pollard & 
Blumstein [54], with permission from Elsevier. 
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Figure 4. Possible functional relationships between attributes of social complexity and attributes of communicative complexity. 
For specific examples of relationships that are supported in the literature, see table 2. 


social group size, Blumstein & Armitage’s [39] demo¬ 
graphic complexity index, and two other indices of 
social structure complexity [61,62]. Group size was 
not correlated with these other complexity indices 
[54]. Because studies quantifying vocal individuality 
are not common, this analysis required novel data, and 
eight species of ground-dwelling sciurid were thus cap¬ 
tured, individually marked and recorded on multiple 
occasions. Individuality was calculated from acoustic 
traits using an information-theory metric developed 
by Beecher [5,73] and used in previous studies 
[25,27,52,74]. The individuality information statistic 
quantifies, in bits, the amount of individually specific 
information content present in a species’ vocalizations. 
This statistic was used as the quantitative metric of 
one attribute of communicative complexity. 

Individual signatures in alarm calls are important 
for animals such as sciurids [75-77], Individuals 
vary in their alarm signal reliability, and listeners 
benefit when they recognize the individual identity of 
an alarm caller, as this allows them to better calibrate 
their behavioural response [76-80]. Ground-dwelling 
sciurids live in groups of closely related kin, such 
that the signaller and receiver are often close relatives. 
This may allow callers to benefit via kin selection by 
aiding related receivers in individual discrimination 
and reliability assessment [54,77], although they may 
also benefit via reciprocal altruism [81]. 

Individuality is expected to evolve with group size 
[5,25-27,52,54,73]. As social group size increases, 
the number of individuals that must be discriminated 
increases accordingly, making individual recognition 
tasks more difficult. Increased individuality would be 
necessary to permit successful discrimination of all 
the individuals in the group. Since other attributes of 


sociality affect communicative complexity [39,82], 
these other attributes may also influence individuality. 

Pollard & Blumstein [54] gathered typical social group 
size data from the literature and regressed this against the 
individuality metric. Group size explained considerable 
variation in vocal individuality, both in the raw data 
and in an analysis using independent contrasts 
(figure 3). As predicted, social group size appears to 
drive the evolution of individual signature information. 

Interestingly, it is group size, not other social 
complexity attributes, that seems responsible for the 
evolution of individually specific vocalizations. Signa¬ 
ture information was also regressed against three other 
measures of social structure complexity, including 
Blumstein & Armitage’s [39] demographic complexity 
index, Michener’s [62] social grade, and Armitage’s 
[61] sociality index. ‘Social grade’ and ‘sociality index’ 
are somewhat subjective numerical assignments 
intended to quantify the complexity of each species’ 
social structure and mating system. Individuality was 
not significantly related to any of these three other 
social complexity attributes, even after controlling for 
variation in group size. 

5. ATTRIBUTES OF SOCIALITY, ATTRIBUTES 
OF COMMUNICATION 

Results from sciurid studies highlight an important 
caveat about the evolution of social and communicative 
complexity. Both sociality and communication have 
multiple attributes (table 1), and these attributes affect 
one another in complex ways (figure 4). The main 
insight from our integrative overview is that different 
attributes of social complexity are likely to drive differ¬ 
ent attributes of communicative complexity. Thus, it is 
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important to think clearly about the nature of the 
relationship because different attributes work different 
ways (see also [1]). Studies from other taxa also support 
taking a nuanced look at the correlated evolution 
of social and communicative complexity (table 2). For 
example, the complexity of reproductive roles was 
found to influence individuality in wasp facial markings 
[82], while nesting or roosting colony size was found to 
influence individuality in avian and chiropteran contact 
calls [25-27]. In phocid seals, underwater repertoire 
size increased with mating system complexity [88]. In 
primates, social group size and time spent in social 
grooming influenced vocalization repertoire size [32], 
while group size influenced ability to produce varied 
facial expressions [35]. Furthermore, social complexity 
may influence not just the complexity of communica¬ 
tive signals themselves but also the perceptual capacity 
to receive such signals, e.g. social group size predicts 
auditory sensitivity in lemurs [84] . Social and commu¬ 
nicative complexity can also covary within species or 
on short time scales; for example, chickadees placed in 
larger groups displayed more variability in their call 
types and combinations [40] . In all these cases, a differ¬ 
ent attribute of social complexity is associated with a 
different attribute of communicative complexity. 

As a general rule, we would predict that the types of 
communicative complexity that are most relevant to a 
specific aspect of sociality would be evolving with 
those aspects of sociality. For example, if individual 
recognition is important, and if all members of a 
group must be recognized, we would predict a specific 
type of communicative complexity (individualistic sig¬ 
natures) to evolve with a particular attribute of social 
complexity (group size). This has been found in birds, 
bats and sciurids [25-27,54] . If calls are used to attract 
or impress mates, we would predict specific attributes of 
communicative complexity (such as syntactical com¬ 
plexity or repertoire size) to evolve with a specific 
attribute of social complexity (mating system). Indeed, 
male repertoire size seems to have evolved with mating 
system in phocid seals [88]. However, these relation¬ 
ships are not always obvious, and attributes of sociality 
may influence communication in diverse and compli¬ 
cated ways (figure 4). Future studies in more taxa will 
be necessary to comprehensively identify the many 
ways in which distinct attributes of social and communi¬ 
cative complexity are evolutionarily or functionally 
linked. With the discovery that social network statistics 
can be used by behavioural biologists [18,91,92], 
there are many precisely defined attributes of sociality 
that could be studied. From a comparative perspective, 
available data are often a limiting factor, but over time, 
there will be more data from different species available 
with which to study the evolution of social and com¬ 
municative attributes [93] . As these databases are 
developed, sciurid rodents can continue to play a 
valuable role in this pursuit, because these species vary 
in several attributes of social and communicative 
complexity, and systematic study can help identify 
which of these attributes covary. These attributes can 
be further examined with respect to different environ¬ 
mental conditions in sciurids’ diverse habitats, and/or 
mapped onto available phylogenies to estimate when 
and how complexity evolved. 


6. CONCLUSIONS 

The relationship between social complexity and com¬ 
municative complexity is multi-faceted. Different 
attributes of sociality may drive the evolution of differ¬ 
ent attributes of communication. For example, in 
sciurid rodents, social demographic complexity 
explains the evolution of alarm call repertoire size, 
while social group size explains the evolution of 
alarm call vocal individuality. Sciurids are one of 
many excellent model systems in which to explore 
the evolution of social and communicative complexity, 
and future work in sciurid and non-sciurid taxa will 
help build a comparative database in which these ques¬ 
tions can be thoroughly examined. The ever-growing 
comparative database will ultimately help us develop 
a comprehensive understanding of how social 
complexity and communicative complexity evolve. 
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Abstract 

This article presents the integration of brain injury biomechanics and graph theoretical analysis of neuronal connections, or 
connectomics, to form a neurocomputational model that captures spatiotemporal characteristics of trauma. We relate 
localized mechanical brain damage predicted from biofidelic finite element simulations of the human head subjected to 
impact with degradation in the structural connectome for a single individual. The finite element model incorporates various 
length scales into the full head simulations by including anisotropic constitutive laws informed by diffusion tensor imaging. 
Coupling between the finite element analysis and network-based tools is established through experimentally-based cellular 
injury thresholds for white matter regions. Once edges are degraded, graph theoretical measures are computed on the 
"damaged" network. For a frontal impact, the simulations predict that the temporal and occipital regions undergo the most 
axonal strain and strain rate at short times (less than 24 hrs), which leads to cellular death initiation, which results in damage 
that shows dependence on angle of impact and underlying microstructure of brain tissue. The monotonic cellular death 
relationships predict a spatiotemporal change of structural damage. Interestingly, at 96 hrs post-impact, computations 
predict no network nodes were completely disconnected from the network, despite significant damage to network edges. 
At early times U<24hrs) network measures of global and local efficiency were degraded little; however, as time increased 
to 96 hrs the network properties were significantly reduced. In the future, this computational framework could help inform 
functional networks from physics-based structural brain biomechanics to obtain not only a biomechanics-based 
understanding of injury, but also neurophysiological insight. 
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Introduction 

The finite element method is often used to study neurotrauma 
[1-9] and continues to emerge as a useful tool in the field of 
neuroscience [10-13]. Models continue to advance in biofidelity 
by incorporating an increased level of anatomic detail [4,13,14], 
improved representation of the material behavior at various 
loading rates [15-20], and advanced measures and predictions of 
injury [8,21,22]. Finite element models are commonly used to 
understand the biomechanics of brain and skull deformation when 
the head is subjected to insult, leading to improved insight into 
mechanisms of acute injury. For example, modeling axonal injury 
mechanisms within white matter of the brain has been the focus of 
some recent efforts and provides a means to relate an insult to a 
cellular injury mechanism [8]. By using diffusion tensor imaging 
(DTI) fiber tractography, the structural orientation of neuronal 
axonal bundles can be incorporated into the finite element model 
and can be used to compute the axonal strain during brain white 
matter deformation [8,23]. Then, by using an axonal injury 
threshold, the occurrence of diffuse axonal injury (DAI) is 


predicted [8]. Wright and Ramesh [8] show that the degree of 
injury predicted is highly dependent on the incorporation of the 
axonal orientation information and the inclusion of material 
anisotropy into the constitutive model for white matter. By 
modeling the underlying mechanism of DAI, an enhanced 
understanding of the neurotrauma is attained through a spatio- 
temporal description of tissue deformation. In general, finite 
element simulations may help to elucidate the injury mechanisms 
of neurotrauma. 

As finite element models advance, experimentally based models 
of neurotrauma also continue to become more sophisticated, 
ranging from the macroscopic [24,25] to the cellular level [26-29]. 
Various biomechanical and physiological injury thresholds for 
neurotrauma have been proposed in the past, including intracra¬ 
nial pressure [21] and strain [8,30,31]. While these thresholds offer 
an immediate prediction of injury, they lack a long-term 
description of functional degradation. A time-evolving injury 
model is attractive since the biological response occurs on a slower 
time scale than an injurious stimulus as a consequence of 
mechanotransduction cascades [27]. There have been significant 
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Author Summary 

According to the Centers for Disease Control and 
Prevention in the United States, approximately 1.7 million 
people, on average, sustain a traumatic brain injury 
annually. During the last few decades, brain neurotrauma 
biomechanics has been an active area of research 
involving medical clinicians and a broad range of scientists 
and engineers. In addition, advances and fast growth of 
human connectomics continues to reveal new insights into 
the damaged brain. With recent advances in computa¬ 
tional methods and high performance computing, we see 
the need and the exciting possibility to merge brain 
neurotrauma biomechanics and human connectomics 
science to form a new area of investigation - connectome 
neurotrauma mechanics. For neurotrauma, the idea is 
simple - inform human structural connectome analysis 
using physics-based predictions of biomechanical brain 
injury. If successful, this technique may be further used to 
inform human functional connectome analysis, thus 
providing a new tool to help understand the pathophys¬ 
iology of mild traumatic brain injury. 

efforts to develop empirically based time-evolving cellular injury 
thresholds, which many times use in vitro cellular and tissue 
culture models. Morrison III et al. [27] suggest that for brain 
biomechanics, neuronal culture models that accurately mimic 
specific brain features can be used to explore tissue properties and 
tolerances or thresholds to mechanical loading. Cell death has 
been primarily used as a definition of injury within the neuronal 
culture model community and has been applied to determine 
tissue-level tolerance criteria using local values of axonal strain, 
strain rate, and time from “insult” [32,33]. Furthermore, as 
Morrison III et al. [27] point out, empirical functions for cellular 
death based on culture models could be incorporated into finite 
element analyses, thereby enabling biological predictions to 
supplement mechanical predictions of local tissue stress and strain. 
In the study presented here, this concept is further explored and 
used as a bridge to a network-based analysis of the brain. 

As physics-based models become more capable of predicting 
tissue-level injury mechanisms from improved computational and 
experimental resources for biomechanics, there remains a need to 
understand how structural damage in a given location of the brain 
evolves, and how it may influence functional or cognitive 
performance over time. Such a goal is complex and difficult. For 
example, as Kaiser et al. point out [34], in some instances the 
brain can be robust to physical damage, and in other instances 
physical damage can cause severe functional deficits. Nevertheless, 
Kaiser et al. [34] pursue the important and unanswered question: 
Are the severity and nature of the effects of localized damage 
predictable? To help explore the answer to this question, tools are 
being developed for the quantitative analysis of brain network 
organization, based largely on graph theory [35,36]. Typically, 
network nodes represent brain regions, often obtained from high 
resolution magnetic resonance imaging (MRI). The network links 
or edges between brain regions represent interregional pathways 
that convey neuronal signals and are commonly obtained from 
non-invasive DTI or diffusion spectrum imaging (DSI) [37,38]. 
The connection matrix of the network of the human brain forms 
the so-called “human connectome” [35,36,39,40]. Recently, Jirsa 
et al. [41] used connectomics to establish a framework for a 
“virtual brain”, in which network modeling is used to understand 
the intact and damaged brain. Similar to previous work, 
hypothetical or random deletion of nodes or edges were used to 


degrade the structural connectome [42]. More recently, structural 
and diffusion images from 14 healthy subjects were used to create 
spatially unbiased white matter connectivity importance maps that 
quantify the amount of disruption to the overall brain network that 
would be incurred if that region were compromised [43]. In this 
study, we attempt to extend the capabilities of neurocomputational 
models by providing a physics-based approach for predicting 
degraded regions of interest. Physics-based injury predictions may 
help inform structural connectome analysis. 

In this study, neurotrauma is investigated by using finite element 
simulations of a single individual subjected to a simulated head 
impact. Tissue damage is computed using empirically based 
damage models that provide a link from macroscopic biomechan¬ 
ical deformation to mesoscopic damage. Axonal bundle tracts are 
explicitly modeled using a multiscale description of white matter 
tracts obtained from diffusion tensor imaging. Then, using the 
physics-based injury predictions for white matter tissue from finite 
element simulations, the structural brain connectivity or con¬ 
nectome is degraded, and various network measures are comput¬ 
ed. This is an important contribution because finite element 
simulation predictions of tissue damage provide physics-based 
reasoning for removing nodes or degrading edges to create the 
“damaged” brain connection matrix. In turn, this approach may 
provide further insight into mild traumatic brain injury by 
shedding light on the relationship between mechanical stimulus 
to the brain and neurobiological processes that result. Further¬ 
more, if successful, the computational framework presented herein 
could supplement ongoing efforts to evaluate the use of non- 
invasive medical imaging tools, such as diffusion tensor and 
spectrum imaging, to detect white matter disruption for neuro¬ 
trauma diagnostics [44,45] by providing a time-evolving history of 
tissue injury. 

Methods 

A suite of medical imaging and software tools are used to obtain 
an individual-specific finite element model and structural con- 
nectome-based analysis. The overarching process is schematically 
shown in Figure 1 and is outlined below. 

Individual Specific Models 

T1 and diffusion tensor magnetic resonance images are taken 
from a single individual (the corresponding data can be found in 
[37]). The T1 image (Figure la) is segmented into different head 
materials (Figure lb) using the software Amira [46] and the 
Connectome Mapper Toolkit [47]. The segmented geometry is 
then used to create a biofidelic three-dimensional finite element 
volume mesh (Figure lc). In order to create a corresponding 
structural connectome or network, the T1 image is parcellated into 
83 regions of interest (ROI) representing the location of 
anatomical regions of the brain based on the Desikan-Killiany 
atlas extended to include subcortical regions [48]. Since diffusion 
tensor images (Figure 1 e) are used to generate axonal bundle fiber 
tractography based on the direction of peak water diffusion in each 
voxel [49], the DTI fiber tractography (Figure If) represents the 
approximate location of neuronal axonal bundles [50]. Fibers are 
filtered for connectome creation to include only fibers that begin 
and end within ROI. The structural connectome is assembled 
using the Connectome Mapper Toolkit [47] and is composed of 
nodes representing ROI generated at the centroid, connected by 
edges that represent structural pathways for which the DTI 
tractography traverses. Following the segmentation enabled 
by the Desikan-Killiany brain atlas, there are 83 network nodes 
and 1029 network edges. Furthermore, DTI fiber tractography is 
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Dynamic Finite 
Element Analysis 


Segmentation and Parcellation 


Tractography 




Figure 1. A flowchart of the process for creating the finite element model and connectome from T1 and diffusion MRI. (a) The T1 

image is (b) segmented into different head materials. The segmented geometry is then used to create (c) a biofidelic three-dimensional finite element 
volume mesh. The mesh is required for (d) an explicit dynamic finite element simulation that captures the biomechanical response from frontal 
impact, (e) Diffusion tensor images are used to generate (f) axonal bundle fiber tractography which is used to inform finite element transversely 
isotropic constitutive descriptions of white matter tissue behavior (c). Tractography is also used to create (g) a network model of the brain that can be 
(h) degraded over time. 
doi:10.1371/journal.pcbi.1002619.g001 
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incorporated into the finite element model by using a transverse 
isotropic material model specifically developed for representing 
white matter tissue [23,51] where each finite element within white 
matter regions is assigned an orientation based on the superpo¬ 
sition of fiber tractography [52]. Details of the numerical 
implementation are published elsewhere [52], but it is important 
to point out that various different cases needed to be considered 
while assigning one orientation per finite element. For example, in 
the case of multiple fibers that overlap the spatial bounds of a 
single element, the multiple liber orientations were averaged. 


That is, Elkin et al. [33] found that cortical cell death was 
dependent on applied strain rate, whereas hippocampal cell death 
was not. The relations used in the present model, obtained from 
Morrison III et al. [27], are: 


D cai / cas 0.0389 


0.36631 
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Damage and Injury Thresholds for Coupling FEM Results 
to the Connectome Analysis 

Herein, finite element simulations of the human head are 
designed to mimic experimental conditions for cadaveric impact 
tests, which are conducted to understand the dynamics of a frontal 
impact and the associated compression-tension damage [53]. In 
this study, the explicit dynamic finite element method [54] is used 
to capture the transient response of head impact. Supplementary 
information about the finite element method is included in the 
supporting information, Text SI. In addition to a volume mesh for 
the head, the finite element model also requires material 
constitutive descriptions and properties for all its components: 
skull, cortex, brain stem, cerebrospinal fluid, and soft tissue, which 
represents a homogenized mixture of muscle and skin. A detailed 
list of the material constitutive laws used to relate tissue strain to 
stress is included in the supporting information, Text S2. Once 
again, we point out that diffusion tensor imaging is used to inform 
the constitutive model for white matter tissue, which has been 
applied in the past for studying non-human primates [31] and 
human injury [8,22,55]. The entire finite element model consisted 
of 1,394,945 tetrahedral elements and 237,115 finite element 
nodes. For boundary conditions, the bottom of the neck is fixed, 
and a force was applied to a circular area on the forehead (about 

r\ 

3 mnr) in the anteroposterior direction. The input force-time 
curve was a sinusoidal shape with a peak force of 7,000 N at 
2.75 ms. The finite element simulation computes the time-evolving 
mechanical strain and stress in the direction of axonal fiber 
bundles. The strain in the direction of axonal bundles is referred to 
as the axonal strain. 

One limitation of the current finite element model is the 
exclusion of viscoelasticity in the constitutive description of brain 
matter. The authors acknowledge that to accurately model the 
progression of damage, the constitutive model should be extended 
to account for the time-dependent behavior of brain tissue. The 
exclusion may have an effect on the outcome of our results, leading 
to larger shear stresses, but smaller shear strains, thus, less 
predicted damage. For example, Chafi et al. [56] shows that 
viscoelasticity plays a major role in the dynamic response of the 
brain under blast loading. Future efforts are focused on improving 
the mechanical description of brain tissue. 

In order to model damage using a physics-based approach, 
either an explicit failure mechanism should be modeled or an 
empirically based failure threshold is required. For this study, 
measures of axonal strain and strain rate computed for white 
matter regions are used as input for empirically based injury 
threshold predictions that are obtained from cellular culture 
experiments. Specifically, experimental results for cellular death 
are described using a mathematical function for tolerance criteria 
that relates strain to resultant cell death evaluated for up to four 
days post-injury [27,32,33]. Experimental data exists for the CA1 
and CA3 regions of the hippocampus, dentate gyrus, and cortex 
for the rat. Experiments suggest that some regions of the rat brain 
are sensitive to loading rate, while other brain regions are not. 


D Cortex = 0.094 


r 1.5293] 
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[• 0.11751 

O 



where t is time from insult, £ is the local strain, and £ is the local 
strain rate [27]. Similar to Morrison III et al. [27], the units of 
time are days, strain is dimensionless and strain rate is inverse 
seconds. The damage parameter, D, is defined as the percent area 
of cell death. Using Equations 1-3, the axonal strain and strain 
rate from the finite element simulations, as well as time, are used to 
calculate percent cell death for a tissue region. Due to insufficient 
resolution of the MRI data used in this study, segmentation of the 
hippocampus into CA1, CA2 and dentate gyms regions was not 
possible, thus the more conservative equation for D^q was used to 
compute cell death for the entire hippocampus. Equations 1-3 are 
monotonic functions that only increase with time, thus cellular 
repair mechanisms and regeneration are not currently captured. 
Potential issues with monotonic cellular death predictions, as well 
as using rat brain injury thresholds instead of human cellular 
injury thresholds, will be discussed later. Since the explicit 
dynamic finite element method is used, the transient wave 
propagation for solid mechanics is resolved; however, this is 
computationally costly and limits the total time of biomechanical 
prediction. The explicit dynamic simulation runs to 15 ms, thus 
the “long-term” structural mechanics of brain swelling, relaxation, 
etc., are not captured in the current model, although could be 
adapted in the future by using quasi-static, implicit finite element 
solvers. 

The reader should understand that we use the local tissue strain 
and strain rates predicted from finite element simulations of short 
duration, about 15 ms, as input to experimentally based cellular 
death models that were developed over a 96 hour period. This 
assumes that the tissue damage due to large deformations occurs 
immediately and initiates an injury process that grows with time. 
This assumption is based on the observation that cell death was 
not immediate in response to deformation, but instead, increased 
over 4 days after injury [27]. 

It should be noted that the cellular death estimates that 
Morrison III et al. [27] developed were not based on axonal strain 
but instead on nominal strain applied to the back of the substrate 
on which the neuronal cells were attached. Thus, Equations 1-3 
may not accurately describe the actual relationship between 
axonal strain and injury. Furthermore, the neuronal cell bodies in 
the experiments were not aligned in a specific orientation, so the 
response is not strictly representative of axonal strain injury. There 
are research efforts examining stretching of individual axons but 
have not proposed empirical relationships for cellular death in 
terms of applied strain, rate of loading and time from insult 
[57,58]. Experimental measurements of the cellular response of 
white matter, especially axonal bundles, would be interesting to 
explore and could provide a more accurate representation of cell 
damage in the future. 

Furthermore, the experiments performed by Morrison III et al. 
[27] used the rat hippocampus, which is mostly comprised of gray 
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matter. Thus, it should be noted that the empirical data described 
in Equations 1-3 was not intended to describe white matter injury 
response so there may be limitations in applying Equations 1-3 to 
predict cellular white matter injury. Cellular injury threshold data 
for isolated white matter is currently limited; however there have 
been some efforts to characterize mechanical damage to axons 
[57-59]. We hope a computational framework described here will 
help to motivate efforts to obtain different white matter and gray 
matter empirical functions, which could then be used for each of 
region of the brain separately. As mentioned before, the 
application of this approach raises the possible need for 
understanding properties of white matter fiber bundles. 

In order to map the finite element results to network-based 
analysis tools, output data from each finite element that represents 
white matter is mapped to a corresponding voxel in the MRI data 
that is used to create the DTI tractography. This mapping is 
referred to as the element-to-voxel map. Multiple finite elements 
within a single voxel are averaged. The element-to-voxel map 
enables voxels to be assigned additional data, including axonal 
strain and strain rate from the finite element simulation. Alstott et 
al. [42] chose to generate brain lesions by altering the structural 
connectivity matrix of the brain by deleting nodes using various 
methods. In the present work, instead of deleting nodes, we 
degrade the edges of the network based on the computed cellular 
death at the voxel level. 


To understand how the structural network is degraded, consider 
the schematic shown in Figure 2. Edges in the network are 
constructed from voxels that connect two different ROIs. The 
edge strength is relative to the number of tracts between two ROI. 
Cellular death for each voxel is computed using Equations 1-3 
and may grow to reach the chosen critical value of cellular death, 
D c (discussed shortly). Voxels with a predicted cellular death 
greater than D c are shown in red in Figure 2. For this study, if a 
tract traverses a voxel that is greater than the threshold, the entire 
tract is considered damaged, thus the edge strength is decreased. 
This procedure is similar to that used by Kuceyeski et al. [43] who 
simulated lesions at each voxel and removed tracts passing through 
the lesion in order to create a damaged network to enable analysis 
of changes in network measures and the evaluation of the 
importance of each voxel. Similar to Honey et al. [60], the 
connection strengths were resampled to a Gaussian distribution 
with a mean of 0.5 and a standard deviation of 0.1. As Alstott et al. 
point out [42], this transformation does not alter the rank-ordering 
of strong to weak pathways, but simply compresses the scale of 
connection strengths. There are other possible ways to degrade the 
structural network that will be discussed later. 

Since Equations 1-3 predict some degree of cell death for non¬ 
zero values of axonal strain, strain rate, and time, the additional 
critical value of cell death, D c , was required as a rule in order to 
degrade a voxel. The critical cellular death of D c = 3% was chosen 
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Figure 2. Schematic showing how structural network edges are degraded over time. Red voxels indicate that the chosen critical cellular 
death threshold, D c , is reached. For this study, DTI tracts that traverse a damaged voxel are removed, thus degrading the connection strength of the 
network edge. 
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because it predicted similar levels of damage as compared to other 
proposed thresholds, including 18% axonal strain, which was used 
previously as a threshold that indicated degradation in electro- 
physiological function [30]. During the calculation of cellular 
death, positive values of strain and strain rate were used (negative 
strain was not used to calculate cellular death). The choice of 
injury threshold, as well as the micromechanics of axonal fiber 
bundles, is an active area of research that would assist making the 
current methodology more accurate in the future. During the 
15 ms dynamic simulation, strain and strain rate data for each 
voxel is output at 0.1ms increments. Cell death is calculated at 
each voxel for each of these increments. Then, the maximum cell 
death value calculated in each voxel is used to predict cell death up 
to 96 hrs. It is important to note that cell death calculations are the 
result of the combination of variables at each increment rather 
than considering each variable independently. 

Results 

Finite Element Analysis of Head Impact 

The deformed configurations of the head, along with contours 
and response curves for various locations within the brain (frontal, 
parietal, occipital, temporal, corpus callosum and cerebellum) are 
shown in Figures 3a—f. Output variables including pressure, axonal 
strain, and effective strain rate, useful for understanding the 
anisotropic biomechanical response for white matter, are shown. 
Confidence in the finite element model is established by 
comparing output from the computations to pre-existing experi¬ 
mental data on cadaveric head impact [53]. The values of axonal 
strain and strain rate were taken from specific locations in each of 
the six regions plotted in Figure 3. Four out of six of the locations 
represented the locations of the pressure transducers for the 
experiments of Nahum et al. [53]. Locations within the cerebellum 
and corpus callosum were also added for the strain and strain rate 
analysis. A variation of results will exist in each region, and taking 
an average of values in a anatomical region could help resolve this 
problem, but including simulation data points that are located 
farther from the experimental sampling points could also make the 
results less accurate. The head impact simulation took about 
30 hours on 32 processors in order to reach 15 ms. 

As seen in Figure 3b, the intracranial pressure quickly increases 
to positive values in the frontal and parietal regions, while quickly 
increasing to negative pressures in the occipital and posterior fossa 
regions. The anteroposterior pressure gradient (seen in Figure 3a) 
is commonly observed in experimental and computational studies - 
giving rise to the so-called coup and contrecoup loading scenario, 
for which there are an associated number of proposed injury 
mechanisms. Short duration intracranial pressure gradients with 
high positive pressures are observed at the coup region, with 
negative pressures at the contrecoup region. The maximum 
positive pressure of approximately 160kPa is reached in about 
2.5 ms in the frontal lobe, closest to the impact, while the 
maximum negative pressure of approximately — 80 kPa is reached 
in about 2.0 ms in the posterior fossa region. The computed 
pressure response is directly compared to Nahum et al.’s [53] 
experimental results in Figure 3b and show similar trends. In 
addition, validation of the strain response against cadaveric 
experiments of Hardy et al. [61] is also described in the 
Supplemental Text S3. 

Axonal strain at the various brain regions responds slower than 
the pressure response. The axonal strain begins to substantially 
grow at 1 ms and shows a gradual rate of change of strain, with a 
maximum of 33% at 12 ms over the duration of the simulation 
within the temporal region. Unlike the pressure, an obvious 


transcranial gradient is not apparent for the axonal strain. While 
the frontal region had the highest predicted pressure, the temporal 
and occipital brain locations have the largest values of axonal 
strain in the regions specifically examined. Later, in the structural 
network analysis, these areas are associated with the largest 
amount of cellular death. Interestingly, if a threshold of injury of 
18% axonal strain is chosen [8,30], our results show the onset of 
injury occurs at approximately 9.1 ms within the temporal lobe, 
despite the intracranial pressure reaching approximately 100 kPa 
in 2 ms. The time scales at which the pressure and strain grow will 
be discussed later in the context of injury cascades. 

The effective strain rate, also commonly referred to as rate of 
loading or loading rate, has a maximum value of approximately 
85 s“ 1 in the temporal lobe. From the contours shown in Figure 3e 
there appears to be a strain rate focusing, with lower strain rates 
closer to the skull and higher values more central to the brain. 
Shear strain focusing has been reported earlier [3,62] and is 
attributed to the partial conversion of energy of the axial impact 
into a shear mechanical stress wave as a result of the material 
response of the brain, cerebrospinal fluid, and skull [62]. The 
maximum strain rate is observed before maximum axonal strain 
because the axonal strain accounts for magnitudes of deformation, 
while the strain rate relates to the rate of change of strain. 

FEM Informed Structural Connectome Analysis 

The axonal strain and strain rate output from the finite element 
simulations are used to compute the amount of cellular death, 
according to Equations 1-3. Figure 4 shows the evolution of 
damaged tractography up to 96 hrs post-impact, using a critical 
cellular death of D c = 3% as a threshold for injury and the 
corresponding evolution of the degraded structural network for 
sagittal and transverse views. The edges in the network, which are 
fully damaged, are shown in red for visualization. In reality, each 
edge is weighted and has degraded values before it is fully 
damaged. The network nodes are scaled by the percentage of 
connections that were removed, so that larger nodes have lost 
more connections compared to original values (1 — [Degree new 
/Degree or i g i na i]). The sagittal and transverse views are shown to 
provide insight about where the damage is predicted and how it 
progresses through time. At 24 hrs, the top four regions affected 
include the cuneus (medial surface of left cerebral hemisphere in 
the occipital lobe), fusiform gyrus (temporal lobe), lingual gyrus 
(occipital lobe), and peri-calcarine. Together, damage in these 
brain regions have 559 tractography fibers removed from network 
edges. Note that the total number of fiber tracts prior to impact 
was 497,442, so this damage corresponds to a 0.011 % degradation 
of tractography. The percentage of fully degraded edges for 24, 48, 
72, and 96 hrs are listed in Table 1. Using an 18% strain criterion, 
17.3% of the edges were fully degraded. 

Figure 4b shows that structural damage occurs first in left 
cuneus and the right superiotemporal regions, resulting in one fully 
damaged edge at 24 hrs. The edge between the two regions had 
only three tract fibers with less than one voxel or 2 mm between 
them. The edge also had a high mean fiber length of 131.7mm 
(the original total mean fiber length was 81.7 mm). This suggests 
that network edges associated with few tract fibers and long fiber 
length are more susceptible to damage. Asymmetry of the 
predicted damage occurs due to the asymmetry of the underlying 
anatomy associated with the finite element mesh, as well as 
potential asymmetry of the fiber tractography, which has been 
studied in recent work [63]. At 48 hrs, predicted damage seems to 
show a high density of damaged fibers in the anteroposterior 
direction in both hemispheres of the brain. Lateral tractography 
damage is also predicted within the corpus collusum. As time 
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Figure 3. Orthographic view of the local three-dimensional response measured at various locations in the brain for (a-b) pressure, 
(c-d) axonal strain and (e-f) strain rate predicted using a finite element simulation for impact to the head. Prediction of the intracranial 
pressure response is compared to cadaveric experiments [53] and is shown in (b). 
doi:10.1371/journal.pcbi.1002619.g003 


progresses, fully damaged rostrocaudal tracts are predicted and 
continue to increase. 

Structural changes to the tractography and resulting network 
arise because of the underlying voxel condition. That is, if a tract 


goes through a voxel that has reached the critical cellular death 
value, D c , the tract is removed. Therefore, it is useful to examine 
how the inherent voxel properties influence results. Figure 5a 
shows the distribution of voxels above the predicted critical cellular 
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Sagittal View 


Transverse View 


96 hrs 


96 hrs 


48 hrs 


72 hrs 


Figure 4. Evolution of damaged tracts and the corresponding structural networks. Using empirically-based cellular death predictions 
obtained from in vitro models of neural tissues, local strain and strain rate values computed from finite element simulations are used to specify injury. 
A computed cellular death of 3% was used as a critical value for defining white matter disruption. Damage is shown in red and the node size 
represents the percent change of degree. The predicted evolution of damage is shown for the sagittal and corresponding transverse views for 24 (a 
and b), 48 (c and d), 72 (e and f), and 96 hours (g and h). 
doi:10.1371/journal.pcbi.1002619.g004 


death of 3% as a function of angle between the axonal fiber bundle 
direction and the direction of the head impact (i.e., frontal impact 
on the anteroposterior axis). If the tract fiber direction within a 


Table 1 . The percentage of fully degraded edges and 
percentage of voxels above the 3% threshold for 24, 48, 72, 
and 96 hrs post-injury. 



Post-Injury (hrs) 

% Fully Damaged 

% Voxels Above 


Edges 

3% Threshold 

24 

0.097 

0.008 

48 

7.19 

1.1 

72 

14.1 

2.7 

96 

19.7 

4.2 


doi:10.1371 /journal.pcbi.1002619.t001 


voxel is parallel with the impact direction, the angle is zero. This 
shows the distribution of damage as a function of angle with respect 
to the impact direction. Because of the high degree of mechanical 
rotation observed through the shear focusing in the temporal and 
occipital brain regions, tract fibers with large angles with respect to 
the impact direction are damaged initially. This is seen in Figure 5a 
at 24 hrs (see red bars in plot) and in Figure 4a-b. 

This data can be further analyzed by normalizing the number of 
damaged voxels for a given orientation by the total number of 
undamaged voxels with a given orientation from the impact 
direction. This measure is referred to as the angle-normalized 
number of damaged voxels. The distribution of angle-normalized 
number of voxels is shown in Figure 5b, and is important because 
it takes into account the original distribution of the number of tract 
fibers eligible to be damaged with respect to the impact direction. 
For example, Figure 5a shows 18 voxels above the 3% cell death 
threshold at 48 hrs; thus, one might assume that axonal bundles 
oriented 0—10° from the impact direction have little importance. 
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Direction And Direction of Impact (deg) Direction And Direction of Impact (deg) 

Figure 5. Distribution of (a) the number of voxels with predicted cellular death above 3% for each day as a function of angle 
between axonal fiber bundle direction and direction of impact to the head, and (b) the number of voxels with predicted cellular 
death above 3% for each day as a function of angle between axonal fiber bundle direction and direction of impact to the head 
normalized by the initial distribution of fiber angle with respect to the loading condition. 

doi:10.1371/journal, pcbi.1002619.g005 


However, when the original number of voxels available to be 
damaged is taken into consideration (Figure 5b), 11—20° angles 
have the highest percentage of damage at 48 hrs. In other words, 
depending on the underlying white matter microstructure, i.e., the 
axonal bundle orientation, with respect to the direction of impact, 
the node, and edge degradation in the structural network may be 
affected differently. This is directly related to the structural 
mechanics (as opposed to structural connectomics) of the 
underlying constitutive or material law used within the finite 
element simulation. 

However, because a monotonic function is used to describe the 
empirical cellular death prediction, this trend becomes more dilute 
as time progresses (but should not be extrapolated past 96 hrs 
since Equations 1-3 are not validated beyond that time). From 
Figure 4g—h, at 96 hours the predicted damaged axonal pathways 
can be seen in many directions and across all white matter brain 
regions indicating diffuse structural degradation. To summarize, at 
24 hrs fibers in the areas of large rotational tissue strain are 
susceptible, while at greater times, neuronal tracts that align with 
the direction of impact seem more susceptible to damage (using 
the assumed threshold of cellular death). In the future, additional 
empirical cellular death predictions that are non-monotonic (if 
valid), or cellular regeneration and repair models, would be useful 
to explore. 

Figure 6a-e shows the evolution of the structural connectivity 
strength matrices predicted as a result of head impact simulations. 
The original structural connectivity strength is defined as a 
Gaussian distribution, created by using a mean of 0.5 and 
standard deviation of 0.14 distributed over the total number of 
edges in the network. Each figure represents a snapshot in time 
starting with t = 0 and ending at 96 hours. Within the 96 hr period 
(for which the monotonic cellular death criteria are validated), as 
long as regions have non-zero strain and strain rate computed 


from the finite element simulation, edges in the network become 
degraded. This decline in network strength is evident in Figure 6e. 
Since a 3% critical cellular death value is only one possible injury 
criteria that could be chosen, an additional criterion was 
examined. Results using two different damage thresholds are 
shown in Figure 6e and f. Figure 6e is the connection matrix using 
the 3% threshold at 96 hrs, while Figure 6f is connection matrix 
using the 18% axonal strain threshold at /=15ms, as used in 
previous work [8,31]. Connection strengths show qualitatively 
similar trends in magnitude and location of edge degradation. 
While the two criteria offer a similar prediction of network 
damage, the cellular death Equations 1-3 are, perhaps, more 
useful because effects of strain, strain rate, and time have been 
decomposed into separate multiplicative terms that allow a 
compartmentalized study of extrinsic biomechanical conditions. 

In general, a network’s global efficiency represents how well- 
connected the network is compared to a perfectly connected 
network [64] and captures the network’s capacity for communi¬ 
cation along short paths [65]. Herein, values are reported as 
normalized global efficiency, which is the global efficiency of the 
network divided by the efficiency of an ideal network. Ideal 
network efficiency is calculated as a network where all nodes are 
connected at the minimum cost. The cost for each edge is defined 
as unity minus Gaussian strength. While strength offers a measure 
of the capacity to send information, the cost indicates the 
resistance to sending information - connections with high strength 
are low in cost. Local efficiency is a network measure that Latora 
[65] suggests helps to reveal the fault tolerance of the network 
system and shows how efficient communication between first node 
neighbors is. For this study, local efficiency is calculated for the 
same set of nodes used to make the local network of the 
undamaged network, even if a node loses its edge to the primary 
node. This method is used to provide a more direct comparison of 
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Figure 6. Connection strength matrices showing degradation over time, t. Connection strengths were resampled to a Gaussian distribution 
with a mean of 0.5 and a standard deviation of 0.1. Because of the monotonic cellular death criterion, as long as regions have non-zero axonal strain 
and strain rate from the finite element simulation, edges in the network eventually become degraded. The connection strength matrices at t = 0 hrs is 
shown in (a). The evolution of connection strength matrices for 0 (a), 24 (b), 48 (c), 72 (d), and 96 hours (e) are shown, as well as (f) the connection 
strength matrix for the case when a 18% strain threshold is used. 
doi:10.1371/journal.pcbi.1002619.g006 


the network after damage by accounting for edges that were 
removed. As a result, edges can no longer be part of a short path 
between nodes and cannot contribute to the efficiency, while also 
taking into account all nodes that should be a part of the local 
network without damage. 

The normalized global and mean local efficiencies as a function 
of time are shown in Figure 7. The normalized global efficiency is 
approximately 0.14 at / = 0hrs and is about the same at 24 hrs, 
indicating the network remains capable to send information. 
However, at 48 hrs an 8.8% reduction in normalized global 
efficiency is predicted and continues to reduce with time. A similar 
trend is observed in the mean local efficiency. At 96 hrs the 
normalized global efficiency was reduced 24%, while the mean 
local efficiency was reduced 27%. By 96 hrs, all but sixteen nodes 
had greater than 20% reduction of local efficiency. Also, note that 
there were no nodes completely disconnected from the network, 
although there were edges completely removed. The total number 
of edges at / = 0hrs was 1029; at / = 96hrs, 203 edges were 
removed using the cellular death threshold. Using an injury 
threshold of 18% strain, 161 edges were removed. 


Watts and Strogatz [66] define the small-world network based on 
the clustering coefficient of the network and the characteristic path 
length of the network. The clustering coefficient, C, is the fraction of 
triangles around a node and is weighted by the geometric mean of 
weights associated with edges of a triangle [64]. It measures how 
well the first neighbors of a node are connected to each other. The 
characteristic path length, L, is the average shortest path between all 
nodes of the network. To qualify as small-world, the networks 
clustering coefficient should be greater than that of an equivalent 
random network, whereas the characteristic path length should be 
approximately equal [66]. The small-world coefficient is defined as 
(C/C ran d)/(^/T ranc i) [64]. This value was found to be 1.87, 1.87, 
2.03, 2.31, and 2.64 for t = 0, 24, 48, 72, and 96 hrs, respectively. 
When using axonal strain as the threshold variable, a value of 2.38 
was computed. The increase seen in the small-world coefficient is 
due to the more rapid decrease in clustering coefficient of the 
random network compared to the damaged network that it is based 
on. In addition, the increase seen in the small-world coefficient also 
shows the ability of the network to maintain its modular structure 
more effectively than the random network that it was compared to. 
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Figure 7. Global and local efficiency in terms of time from impact. 

doi:10.1371 /journal, pcbi.1002619.g007 


The percent reduction of local efficiencies and the associated 
betweenness for the top 10 regions affected by impact at 96 hrs are 
listed in Table 2. Betweenness is defined as the number of times a 
node is part of a shortest path between nodes, and is one indication 
of a node’s network centrality that help to describe the 
“importance” of a node [64,65]. Hubs within the connectome 
are identified using betweenness centrality. The maximum 
betweenness of the original network at / = 0 is 226, while the 
minimum value is zero. Recall from Figures 4a—b and 5a, white 
matter disruption is observed in the left cuneus and the right 
superiotemporal regions, which has a betweenness centrality of 41 
and 17, respectively. The moderate to low betweenness of the 
regions first affected at 24 hrs helps to explain why the normalized 
global efficiency was not significantly reduced during this time and 
shows the importance of a robust brain network. At 96 hrs, the 
right hemisphere lateral orbitofrontal region shows the largest 
degradation in local efficiency of approximately 45.4%; however, 
that region also has a zero betweenness centrality. On the other 
hand, the right hemisphere medial orbitofrontal region has a 
betweenness of 110 and shows a 37.4% reduction of local 
efficiency, indicating it may have a more significant effect to the 
brain network. At 96 hrs, there are 10 network nodes with 
betweenness greater than 100 that have at least a 20% reduction of 
local efficiency. Local efficiency and betweenness for all ROIs are 
included in Supplemental Figures S2 and S3, respectively. 

The results reported thus far are based on a critical cell death 
value of D c = 3%, as well as a critical strain threshold of 18%. 
However, it is also useful to evaluate the sensitivity of the results to 
the choice of D c . An analysis of network properties was performed 
for multiple critical cell death values, in the range from 
D c = 2—10%, and are shown in Figure 8a-b. Figure 8a shows 
the percent reduction in total edge strength as a function of the 


critical cell death threshold choice. Figure 8b shows the percent 
reduction in global efficiency as a function of the critical cell death 
threshold choice. Results show that the choice of the critical 
threshold significantly affects both network measures that were 
examined. For example, for a change of D c between 3% or 4%, 
the reduction in global efficiency is 23.9% and 16.4%, respec¬ 
tively. In other words, from only a 1 % change in the choice of D c 
results in a 7.5% change in global efficiency reduction. Therefore, 


Table 2. Structural measures of simulated lesions for the top 
ten regions ranked according to percent reduction in local 
efficiency. 



Brain Region 

% Reduction 

Betweenness 


Local Eff. at 96 hrs 

at 0 hrs 

Lateral Orbitofrontal (RH) 

45.4 

0 

Parahippocampal (RH) 

44.2 

0 

Parsorbitalis (RH) 

44.1 

0 

Transverse Temporal (RH) 

42.0 

0 

Pericalcarine (RH) 

39.4 

1 

Parstriangularis (RH) 

39.3 

2 

Temporal Pole (RH) 

39.1 

3 

Rostral Anterior Cingulate (RH) 

38.4 

0 

Frontal Pole (RH) 

38.1 

12 

Superior Temporal (RH) 

37.9 

21 


RH and LH refer to right and left hemispheres, respectively. 
doi:10.1371/journal.pcbi.1002619.t002 
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the choice of D c will be important in obtaining accurate results 
and highlights the need for future experimentation to characterize 
cellular injury criteria for all areas of the brain in order to improve 
the accuracy of this modeling approach. 

Discussion 

For the first time, a physics-based model has been linked to a 
network-based analysis that establishes a coupled computational 
method to study the effects of localized structural damage or 
lesions. In the presented work, lesions are simulated by using a 
state-of-the-art finite element model of the human head, developed 
for a single individual directly from MRI, impacted on the 
forehead region. The local finite element variables are coupled to a 
network-based analysis through an empirically based cellular 
injury model. The new approach attempts to capture the 
spatiotemporal structural characteristics of brain trauma. Foun¬ 
dations of this idea stem from previous studies that attempted to 
establish relationships of lesion sites and the resulting functional 
impact [42,67]. 

The first part of our study involved developing a new three- 
dimensional human head finite element model for simulating the 
biomechanical response from frontal impact and comparing it to 
experimental data obtained from cadavers. The finite element 
model is unique in that it uses diffusion tensor imaging 
tractography to inform structural mechanics constitutive laws of 
the underlying white matter anisotropy in an effort to help 
elucidate the injury mechanisms of impact neurotrauma. Simula¬ 
tions of frontal impact capture the coup and contrecoup loading 
profiles, including short duration intracranial pressure gradients 
with high positive pressures at the coup region, and negative 
pressures at the contrecoup region (Figure 3a—b). The computed 
pressure compares well to past experimental results [53]. While the 
frontal region had the highest predicted pressure, the temporal 
and occipital brain locations had the largest values of axonal strain 
in the regions specifically examined and responded slower than the 
pressure response. Because of varied mechanical tissue properties 


and constraints with the head, translational cranial motion causes 
relative brain movements that happens after peak pressure, and 
leads to large brain deformations and significant localized regions 
of axonal strain (Figure 3e—f). 

Empirically based cellular death thresholds were used to predict 
the time-evolving damage in various brain regions based on finite 
element-based predictions of local axonal strain and strain rate. 
The biomechanical simulations predict that the temporal and 
occipital regions undergo the most axonal strain and strain rate at 
short times (less than 24 hrs), which leads to cellular death 
initiation that produces damage which shows dependence on angle 
of impact and underlying microstructure of brain tissue. The 
cellular death model that was used in this study is based on 
experimental observations that cell death was not immediate in 
response to deformation, but instead increased over four days after 
insult. Tissue damage becomes more dilute as time progresses 
(Figure 4g—h). At 96 hours, the predicted damaged axonal 
pathways can be seen in many directions and across all white 
matter brain regions, indicating diffuse structural degradation. 

Interestingly, when using injury criteria proposed in the past, 
including thresholds of axonal strain [8,30] or intracranial pressure 
[21], the finite element simulations predict injury within 10 ms of 
frontal impact. However, when using a cellular death criteria, 
damage takes longer to develop but offers a similar result to the 
resulting network strength (Figure 6). Thus, while the model may 
be limited in the most accurate description of cellular injury, it 
does seem to capture the underlying pathology of diffuse brain 
injury, i.e., widespread damage to axons in the brain, to some 
degree. While the two criteria offer similar prediction of network 
damage, cellular death, is perhaps, more useful because effects of 
strain, strain rate, and time have been decomposed into separate 
multiplicative terms that allow a compartmentalized study of 
extrinsic biomechanical conditions. On the other hand, as 
Morrison et al. [27] point out, purely mechanical definitions of 
injury thresholds may be too insensitive to identify the onset of 
injury for the brain, since biological tissues are alive and perform 
some form of active, physiological functions. For brain tissue, 
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Figure 8. Changes in (a) total edge strength and (b) global efficiency with change in critical cell death threshold for 24, 48, 72, and 
96 hours. 

doi:10.1371/journal.pcbi.l 002619.g008 
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failure can be defined in various ways and may occur far below 
mechanical failure limits. Therefore, additional tolerance criterion 
that capture degradation of the electrophysiological function may 
be also required for brain tissue, since injury mechanisms that may 
alter neuronal function without requiring cell death are observed 
and may become activated more quickly than cell death [68]. 

This work has attempted to establish a physics-based method¬ 
ology to inform structural connectome analysis. In the current 
model, network edges are degraded by weight rather than simply 
deleting nodes, in an attempt to include the effects of damage on 
white matter “fibers of passage” that Alstott et al. [42] refer to. It is 
assumed that tracks representing bundles of axons are able to be 
disconnected when passing through regions of high cell death, 
simulating reduction in the strength of a structural connection, and 
are related to a decrease in localized white matter integrity. All 
network damage was linked directly to predicted tissue deforma¬ 
tion and predicted cellular death. Interestingly, at 96 hrs post¬ 
impact, the methods used here did not lead to any node 
completely disconnected from the network (although there were 
edges completely disconnected). At early times (/<24hrs) network 
measures of global and local efficiency were degraded little, 
however, as time increased to 96 hrs the network properties were 
significantly reduced (Figures 6 and 7). Alstott et al. [42] found that 
random removal of nodes did not affect network integrity until 
almost all of the nodes were deleted. Thus, this method may 
capture some structural network features of very mild neuro¬ 
trauma at early times, but would benefit from a functional network 
analysis to explore the potential outcomes. As the network was 
damaged in certain areas, edges were lost or degraded, raising the 
cost to send information between nodes and producing different 
short paths between nodes. As a result, the brain network was not 
able to maintain efficiency by using alternate paths or finding 
strong hub connections past 24 hrs post-impact, thus demonstrat¬ 
ing a potential limitation of the brain to retain network robustness 
in extreme conditions that cause neurotrauma. 

While global efficiency is able to demonstrate widespread effects 
of damage, local efficiency provided a measure to investigate 
localized damage within the network related to areas of 
concentrated axonal strain and strain rate in particular areas of 
the brain. There was a much larger reduction in local efficiency at 
areas of high cell damage compared to reduction in the 
normalized global efficiency, indicating that the brain as a whole 
is resistant to some degree of localized damage. Brain regions that 
experienced the largest cellular death showed a larger reduction in 
local efficiency compared to the global efficiency of the network 
(Table 2). However, the low betweenness of many of these ROI 
suggest that they were not as necessary for communication outside 
their local area. This suggests that the modular nature of the 
network, including its small world properties, helped to prevent 
loss of efficiency on the global scale from damage at the local scale. 
This is interesting because the network considered here is a rigid, 
static, anatomic network in which there is no adaptability built into 
the model. However, it seems that damaged structural network 
hubs retain the ability for long distance communications. In 
addition, the increase predicted in the small-world coefficient also 
shows that modularity was not as affected by the damaged static 
network, as compared to a random network. 

Limitations and Future Work 

There are exciting possibilities for future work, as well as 
limitations to the current modeling approach. The current model 
did not attempt to model the coupled effects to the functional 
network; instead, we provided an example for a single individual in 
order to establish a methodology to link physics-based predictions 


of tissue damage with structural network analysis for frontal 
impact neurotrauma. It is important to note that the empirical 
relationships may only be accurate for the rat (not the human), and 
most likely, there are many more regions that need to be 
characterized. Our results and conclusions may be altered 
according to these injury thresholds. Although human injury 
thresholds are currently limited, as additional brain region injury 
thresholds are experimentally characterized and improved they 
can be included in the future. Due to the computational cost of the 
finite element simulations, the brain was only segmented into 83 
different regions. In the future, increased segmentation of regions 
of interest and improved biofidelity of the finite element model 
would increase the resolution of the analysis. While this study does 
not address the resulting functional outcome from structural 
degradation, coupled structure-function relationships as a result of 
neurotrauma would be interesting to explore. For example, a 
coupled analysis may enable functional stimulus that may prohibit 
or enhance further cell death. For the prediction of tissue damage, 
additional physics, such as electrochemical reactions, may be 
useful by incorporating diffusional properties. In addition, 
increased resolution of the biomechanical response may also be 
improved by further developing white matter material response 
descriptions that use multiple fiber tract orientations within a 
single element, thereby enabling the capability to use diffusion 
spectrum imaging. There is also an opportunity to use this 
framework to explore additional injury mechanisms or thresholds 
from empirical or experimental data, such as intracranial pressure. 
Note that the current methodology degrades network edges, 
instead of nodes. In the future, it may be useful to investigate 
methods for degrading nodes in addition to edges because the all 
of the nodes represent cortical gray matter, and the gray matter 
does experience significant strains. This would require a choice of 
strain measurement other than axonal strain since the gray matter 
is treated as isotropic. With this type of criteria, it may be possible 
to degrade a node based on a ratio of damaged voxels within a 
region of interest compared to the voxel volume of the region as a 
whole. 

Further work should also work to validate this approach in 
humans. There are at least two distinct areas associated with 
connectome damage that should be explored in order to validate 
the approach described herein. The first deals with how well the 
location of damage within the network description is captured 
using physics-based predictions. The second validation strategy 
should address the nature in which edges and nodes in the network 
are degraded. There are various approaches that may be useful for 
addressing both areas. For example, in order to validate how well 
the location of damage is captured, further understanding about 
how cellular level changes effect fractional anisotropy, that results 
in altered fiber tractography would be useful to develop. Some of 
this information may be obtained from various ongoing studies 
that are examining the ability of DTI to diagnose mTBI, which 
could also be extended to create degraded structural connectomes. 
One way to do this may be to use DTI studies pre- and post-injury 
from typical loading conditions that cause rotation-induced diffuse 
axonal injury. By providing similar loading profiles within the 
simulation and comparing computed DTI tractography damage 
with the clinical data set, maybe results can be compared. Perhaps 
this may be accomplished using sports-related impact injury, such 
as American football, where many helmets have sensors built-in to 
record impact loads. 

In conclusion, this work has explored “connectome neuro¬ 
trauma mechanics” by using physics-based finite element simula¬ 
tions to help elucidate injury mechanisms associated with 
neurotrauma by using various cellular injury thresholds to define 
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tissue damage, and established a coupled computational frame¬ 
work to inform structural connectome analysis. 

Supporting Information 

Figure SI The magnitude of peak relative displacement between 
skull and brain for validation of finite element model. 

(EPS) 

Figure S2 Percent reduction in local efficiency at 96 hrs for all 
brain regions. 

(TIF) 

Figure S3 Percent reduction in betweenness at 96 hrs for all 
brain regions. 

(TIF) 
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There are a substantial number of empirical relations that began with the identification 
of a pattern in data; were shown to have a terse power-law description; were interpreted 
using existing theory; reached the level of “law” and given a name; only to be subse¬ 
quently fade away when it proved impossible to connect the “law” with a larger body of 
theory and/or data. Various forms of allometry relations (ARs) have followed this path. 
The ARs in biology are nearly two hundred years old and those in ecology, geophysics, 
physiology and other areas of investigation are not that much younger. In general if X 
is a measure of the size of a complex host network and Y is a property of a complex 
subnetwork embedded within the host network a theoretical AR exists between the two 
when Y = aX b . We emphasize that the reductionistic models of AR interpret X and 
Y as dynamic variables, albeit the ARs themselves are explicitly time independent even 
though in some cases the parameter values change over time. On the other hand, the 
phenomenological models of AR are based on the statistical analysis of data and inter¬ 
pret X and Y as averages to yield the empirical AR: (Y) = a(X) b . Modern explanations 
of AR begin with the application of fractal geometry and fractal statistics to scaling phe¬ 
nomena. The detailed application of fractal geometry to the explanation of theoretical 
ARs in living networks is slightly more than a decade old and although well received it 
has not been universally accepted. An alternate perspective is given by the empirical AR 
that is derived using linear regression analysis of fluctuating data sets. We emphasize 
that the theoretical and empirical ARs are not the same and review theories “explain¬ 
ing” AR from both the reductionist and statistical fractal perspectives. The probability 
calculus is used to systematically incorporate both views into a single modeling strategy. 
We conclude that the empirical AR is entailed by the scaling behavior of the probability 
density, which is derived using the probability calculus. 

Keywords : Allometry; fractals; physiology; scaling; statistical analysis; ontogenetic 
growth. 
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1. Introduction 

At the turn of the nineteenth century the German polymath Gauss 64 and the Amer¬ 
ican mathematician Adrian 1 introduced into science the law of frequency of errors, 
the French physicist Laplace 116 proved the central limit theorem and the French 
zoologist Curvier 34 determined that the brain mass of a mammal increases more 
slowly than does its total body mass (TBM). Cuvier’s observations initiated the 
field of Allometry that relates the size of an organ to that of the host organism with 
mass serving as a measure of size. The roads of statistical analysis and observation 
(experiment) converge in the twenty-first century to explain the origin of allometry 
relations (ARs). 

Allometry, literally meaning by a different measure, has acquired a mathematical 
description through its relations along with a number of theoretical interpretations 
to account for its mathematical form. However no one theory has been universally 
accepted as successfully explaining ARs in their many guises so the corresponding 
origins remain controversial. Consequently, in reviewing the properties of allometry 
data along with their various theoretical explanations we herein provide a glimpse 
as to those origins. 

We use the generic term network in our narration in order to slip smoothly from 
ARs in physical networks with identical particles and van de Waal’s forces, to bio¬ 
logical networks with structured elements and chemical interactions, to geophysical 
networks with complex tributaries and branching architectures. This nomenclature 
also enables us to transition from arcane historical theory to a modern perspectives 
of complex networks. The mathematics of renormalization group (RG) theory, 103,259 
fractional differential equations, 126,144 fractional stochastic differential equations 236 
and transitioning from dynamic variables to phase space variables to express the 
probability calculus in terms of fractional diffusion equations 108,207,236 are herein 
found to provide insight into different aspects of the origins of ARs. 

Allometry has been defined as the study of body size and its consequences 79,176 
both within a given organism and between species in a given taxon. Gayon 65 re¬ 
viewed the history of the concept of allometry, defined as the study of body size 
and its consequences 79,176 within a given organism and between species in a given 
taxon, and distinguished between four different forms: (1) ontogenetic allometry, 
which refers to relative growth in individuals; (2) phylogenetic allometry, which 
refers to constant differential growth ratios in lineages; (3) intraspecies allometry, 
which refers to adult individuals within a species; (4) interspecies allometry, which 
refers to the same kind of phenomenon among related species. The theoretical 
entailment of static from dynamic allometry models has not been systematically 
studied, although there has been some recent effort in that direction. 67,243 Herein 
we review the use of phenomenological dynamic equations from physical biology to 
relate the dynamic allometry of the first two categories to their static counterparts 
in the last two. 

Galileo 62 recognized that in order for an organism or physical structure to retain 
a constant function as size increases requires its shape (architecture) and/or the 
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materials with which it is constructed to change. This simple physical picture has 
biological and social analogs in which the size of a network has unavoidable con¬ 
sequences . 164,213 We focus on ARs for physiologic phenomena, but we do indicate 
where insight might also be gained from other disciplines as well. 

1.1. What is the equation? 

Sir Julian Huxley 98 ; grandson of the Huxley of Darwin evolution fame, brother of 
the novelist Aldous (Brave New World) and half-brother of the biophysicist Andrew 
(the Hodgkin-Huxley equations) 209 ; proposed that two parts of the same organism 
have proportional rates of growth. In this way if Y is a living subnetwork observable 
with growth rate d and X is a measure of the size of a living host network with 
growth rate 7 then the fractional increase in the two is denoted according to Huxley 
by: 

dX/^X = dY/dY . (1) 

This equation can be directly integrated to obtain: 

Y = aX b , ( 2 ) 

the time-independent AR where a and b (= 7 /d) are empirically determined. The 
theoretical AR given by Eq. (2) considered by Huxley is the basis of subsequent 
theoretical discussions in such excellent books as Schmidt-Nielson 195 and Calder . 31 

The intraspecies AR relates a property of an organism within a species to its 
TBM. The interspecies AR relates a property across species such as the basal 
metabolic rate (BMR) to TBM . 31,195 These two allometry groups are distinctly 
different and the models developed to determine the theoretical forms of the al¬ 
lometry coefficient a and exponent b in the two cases are quite varied, as shown 
subsequently. 

Equation (2) looks very much like the scaling relations that have become so 
popular in the study of complex networks over the last decade . 3,30,153,221,239 His¬ 
torically the nonlinear nature of Eq. (2) has precluded the direct fitting of the 
equation to data. A logarithmic transformation is traditionally made and a linear 
regression to the data on the equation 

In Y = In a + b In X (3) 

is used to estimate the parameters a and b. In Sec. 2 we review a myriad of phe¬ 
nomena from a number of disciplines in which ARs have been brought to light. In 
Sec. 3 we discuss the fitting of ARs to data. 

All complex dynamical networks manifest fluctuations, either due to intrinsic 
nonlinear dynamics producing chaos 124,155 or due to coupling of the network to an 
infinite dimensional, albeit unknown environment , 118 or both. The modeling strate¬ 
gies adopted to explain ARs have traditionally taken one of two roads: the statistical 
approach in which residual analysis is used to understand statistical patterns and 
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to identify the causes of variation in the AR; 31,195 or the reductionist approach to 
identify mechanisms that explain specific values of the allometry parameters. 10,247 
We find that neither approach separately provides a complete explanation of the 
variety of phenomena described by ARs. The influence of the environment, whether 
inducing fluctuations in a reductionist model, or producing a systematic change in a 
statistical model, has been taken into account in multiple studies. 71,73,141 In Sec. 5 
we take the road of the probability calculus that systematically incorporates both 
reductionistic and statistical mechanism into the phenomenological explanation of 
ARs. This calculus enables modelers to associate characteristics of the measured 
probability density function (pdf) with specific deterministic mechanisms and with 
structural properties of the coupling between variables and fluctuations. 118,175 

1.2. Is there universality? 

In physics the notion of universality occurs in the context of critical phenomena, 
where in the vicinity of a phase transition a network ceases to have a characteristic 
scale and can be characterized by a set of critical exponents. The principle of 
universality claims that networks that undergo a phase transition can be grouped 
into one of a small number of universality classes. Gisiger 68 discusses universality in 
the context of invariance in biology; specifically addressing the role of 1// noise. In 
the context of AR universality might imply that the allometry exponent plays the 
role of a critical exponent. The empirical evidence suggests that this interpretation 
of the allometry exponent is not appropriate. A somewhat less stringent definition 
of universality may be defensible; one in which the details of the interactions are 
washed out but the form of the AR persists with the allometry exponent in a 
restricted interval. 

In biology the ARs associate functional variables with measures of body size, 
such as the TBM X = M raised to a noninteger power M b . The average BMR is one 
such functional variable Y = B that can be expressed in terms of TBM. The most 
prevalent theories of metabolic allometry argue for either 6 = 2/3, based on body 
cooling, or b = 3/4, based on energy efficiency. Selected data sets have been used 
by various investigators to support either of these two values. However, there is also 
strong evidence that there is no universal value of b that is satisfied by all metabolic 
data. In fact, Bokma 21 presents a large amount of fish data to demonstrate that no 
single universal value of the allometry exponent b exists. This argument is supported 
by Glazier 72 who observes that the isometric (6=1) metabolic scaling of pelagic 
animals is an evolutionarily malleable trait that responds to environmental changes. 

On the other hand, West and Brown 252 argue that living networks do have uni¬ 
versal scaling laws. Their arguments rest on patterns observed in biological and 
botanical data and the quantitative theory of the fractal structure, organization 
and dynamics of the branching networks in living systems. The theory developed 
by West et al, 247 which we review in Sec. 4, has as one of its tenets the existence of 
hierarchical fractal-like branching networks for the delivery of nutrients resulting in 
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b = 3/4. They attribute this origin of metabolic AR to evolution’s solution to the 
grand challenge of how highly complex, self-sustaining, reproducing, living networks 
service enormous numbers of localized microscopic units in an efficient and “demo¬ 
cratic” way. Their conclusion, like that of the earlier analysis others, 230,231 was 
that fractal networks have an evolutionary advantage over those that scale classi¬ 
cally, 213 independently of what the networks distribute from macroscopic reservoirs 
to microscopic sites. 

Scaling is a ubiquitous property of large complex networks indicating that the 
observables simultaneously fluctuate over many time and/or space scales. In the 
physical sciences such phenomena have historically been categorized as 1// noise 225 
or 1// variability. 239 Mandelbrot 131 was probably the first to recognize the wide- 
ranging significance of this 1// variability with his introduction of fractals into the 
scientist’s lexicon. The existence of ARs has been closely tied to fractal geometry 
by some investigators 247 and in Sec. 5 we show that the origin of AR may reside in 
fractal statistics, the scaling of pdfs , and not in fractal geometry. 


1.3. Some comments on fractals 

Mandelbrot 131,132 identified ARs masquerading under a variety of empirical “laws” 
and argued that they were a consequence of complex phenomena not having char¬ 
acteristic scales. Subsequent interpretations of ARs often involve fractals and so 
we recall some fundamental properties of fractals that are subsequently useful. To 
begin let us give a qualitative definition of a fractal: 133 “A fractal is a shape made 
of parts similar to the whole in some way.” 

The fractal concept arises in three distinct, but related guises; geometry, statis¬ 
tics and dynamics. Geometric fractals deal with the self-similarity of complex geo¬ 
metric forms. A fractal object examined with ever increasing magnification reveals 
ever greater levels of detail; detail that is self-similar in character. The basic math¬ 
ematical properties of geometric fractals and their myriad of applications can be 
found in a number of excellent books 17,55,57,131,133,143,238 and is not repeated here. 
We merely record and interpret those properties that may be needed for the anal¬ 
yses of ARs. The number of self-similar objects N required to cover an object of 
dimension D is given by N = r -D , where r is the size of the “ruler”. In this way 
the fractal dimension of the object being covered can be mathematically defined as: 

D = — hi N/ In r (4) 

in the limit of vanishing r. As the ruler size goes to zero the number of rulers 
necessary to cover the object diverges to infinity in such a way that D remains 
finite for self-similar objects and this dimension is not necessarily integer valued. 

An observable Z(t) is scaling if for a positive constant c it satisfies the homo¬ 
geneity relation 


Z(ct) = c H Z(t). 


( 5 ) 


121 


On Allometry Relations 

Modifying the units of the independent variable therefore only changes the overall 
observable by a multiplicative factor; this is self-affinity. Barenblatt 13 remarked 
that such scaling laws are not merely special cases of more general relations; they 
never appear by accident and they always reveal self-similarity. Note that scaling 
alone is not sufficient to prove that a function is fractal, but if a function is fractal 
it does scale. 

In the sequel we relax the distinction between self-affine and self-similar, since 
self-similarity has been extended to encompass both meanings in the physics liter¬ 
ature. Meakin 143 asserts that in homogeneous scaling relations it is the coefficient 
that embodies the “real physics” behind power-law relations. He further observed 
that the allometry exponents are universal in many homogeneous scaling phenom¬ 
ena and the allometry coefficients provide the only means to control physical prop¬ 
erties and behavior. 

Scale invariance or scaling requires that a function <h(Xi,..., Xn) be such that 
scaling each of the N variables by an appropriate choice of exponents (aq,..., o/v) 
always recovers the same function T(Ai,..., Xn) up to an overall constant: 

$(W, ...,X N )= AHl ai Xi,j aN X N ). (6) 

We observe that Eq. (2) is possibly the simplest of such scaling relations between 
two variables such that they satisfy the RG relation 

X(jY) = 7 b X(Y). 

The lowest-order solution to this equation is, of course, given by Eq. (2) and we 
provide the general solution subsequently. Changes in the host network X (size) 
control (regulate) changes in the subnetwork Y (property) in living networks and 
in some physical networks through the homogeneous scaling relation. 

Inhomogeneity in space and intermittency in time are the hallmarks of fractal 
statistics and it is the statistical rather than the geometrical sameness that is ev¬ 
ident at increasing levels of magnification. In geometrical fractals the observable 
scales from one level to the next. In statistical fractals where the phase space vari¬ 
ables (z,t) replaces the dynamic variable Z(t) it is the pdf P(z,t) that satisfies a 
scaling relation: 


P(az,fit)=fi M P(z,t); /i = ln<a/ Infi , 



where r = 2 H and the homogeneity relation is interpreted in the sense of the pdf in 
Eq. (7). Time series with such statistical properties are found in multiple disciplines 
including finance, 134 economics, 135 neuroscience, 4,226 geophysics, 215 physiology 240 
and general complex networks. 244 A complete discussion of statisical data with such 
scaling behavior is given by Beran 20 in terms of the long-term memory captured 
by the scaling exponent. One example of a scaling pdf is given by: 




z 

tv 


( 8 ) 
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and in a standard diffusion process Z(t) is the displacement of the diffusing particle 
from its initial position at time t, (i = 1/2 and the functional form of F z (-) is a 
Gauss distribution. However, for general complex phenomena there is a broad class 
of distributions for which the functional form of F z (-) is not Gaussian and the 
scaling index fi ^ 1 / 2 , see Sec. 5 for additional discussion. 

Dynamic fractals do not directly enter our discussion of ARs. However for com¬ 
pleteness we mention that in a dynamic fractal the geometry of the manifold on 
which the dynamics of a network unfolds is fractal, so that the associated chaotic 
time series is also fractal . 155 

2. Empirical Allometry 

In this section we review areas of investigation where size has been observed to 
control the properties of a phenomenon. In so doing we catalog a number of phe¬ 
nomenological relations that are not often discussed from the same perspective. 
There is a nontrivial number of empirical relations that began as the identification 
of a pattern in data; were shown to have a terse power-law description; were in¬ 
terpreted using existing theory; reached the level of “law” and given a name, not 
always after the discoverer; only to subsequently fade away when it proved impos¬ 
sible to connect the “law” with a larger body of theory and/or data. An example 
drawn from the Notebooks of Leonardo da Vinci 177 relates the diameter of a parent 
limb do to two daughter limbs d\ and cfe : 

d% = d? + d% . (9) 

The da Vinci scaling relation supplies the phenomenological mechanism necessary 
for AR to emerge in a number of disciplines, as we subsequently discuss. 

Nearly five hundred years after de Vinci recorded his observations Murray 150 
used energy minimization to derive the same equation with the theoretical value 
a = 3, which is known in the literature as Murray’s Law or the Murray-Hess Law. 
In the simplest case d\ = cfe the da Vinci scaling relation reduces to scaling between 
sequential generations of a bifurcating branching network having daughter branches 
of equal radii: 

4+i = 2 _1/a 4 (10) 

resulting in an exponential reduction in branch diameter from generation to 
generation. 


2.1. Living networks 

Living networks have static intraspecies ARs that link two distinct but interacting 
parts of the same organism in terms of mass, with mass serving as a measure of 
size. Smith 202 maintained that concentrating on a power function as the method 
for evaluating the biological consequences of size has masked the complexity ofthe 
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allometry problem. We agree with this observation, but perhaps in ways that Smith 
would not have anticipated, as will become evident. 


2.1.1. Biology 

Cuvier 34 was the first to recognize that brain mass increases more slowly than 
TBM as we proceed from small to large species within a taxon. This empirical 
observation was subsequently made between many other biological observables and 
was first expressed mathematically as an allometric relation by Snell 204 : 

brainweight = a(bodyweight) 6 , (11) 

where on log-log graph paper a is the intercept with the vertical axis and b is the 
slope of the line segment. Mammalian neocortical quantities Y have subsequently 
been empirically determined to change as a function of neocortical grey matter 
volume X as an AR. The neocortical allometry exponent was first measured by 
Tower 214 for neuron density to be approximately —1/3. The total surface area of 
the mammalian brain was found to have an allometry exponent of approximately 
8/9. 94,100,167 Changizi 33 points out that the neocortex undergoes a complex trans¬ 
formation covering the five orders of magnitude from mouse to whale depicted in 
Fig. 1 but the ARs persist; those mentioned here along with many others. 



Fig. 1. Mouse to elephant curve. BMR of mammals and birds are plotted versus TBM on log-log 
graph paper. The solid line sement is the best linear regression of Eq. (2) to the data with a slope 
very close to 3/4. [reproduced from Schmidt—Nielsen 195 with permission]. 
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Another quantity of interest is the time; not the chronological time measured by 
a clock but the intrinsic time of a biological process first called biological time by 
Hill. 93 Hill reasoned that since so many properties of an organism change with size 
that time itself may scale with TBM. Lindstedt and Calder 119 develop this concept 
further and determine experimentally that biological time, such as species longevity, 
satisfies an AR with Y being the biological time. Lindstedt et al. 121 clarify that 
biological time r is an internal mass-dependent time scale 

r = aM b (12) 

to which the duration of biological events are entrained. They present a partial list 
of such events that includes breath time, time between heart beats, blood circulation 
time and time to reach sexual maturity. In all these examples and many others the 
allometry exponent clusters around the theoretical value b = 1/4. Note that the 
total energy of an organism seen as a bioreactor is proportional to volume (M) and 
the biological time is proportional to M 1 / 4 , so the metabolic rate (energy/time) 
would scale as M 3 / 4 . 


2.1.2. Botany 

Niklas 154 shows in Fig. 2 an impressive statistical trend spanning twenty orders of 
magnitude in the mass of aquatic and terrestrial nonvascular and vascular plant 
species. The annual growth in plant body biomass Gt (net annual gain in dry 
mass per individual) and Mt (total dry mass per individual) are related by the 



Fig. 2. Log-log bivariate plot of total annual growth rate in dry body mass per individual Gt 
versus TBM. Line segment denotes reduced major axis regression curves for the entire data set. 
[adapted from Niklas 154 with permission]. 
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empirical AR: 


G t oc Mp . (13) 

Figure 2 shows linear regression on logarithmically transformed data 

log Gt = log a T 0.75 log Mt (14) 

and the parameter a is the intercept with the vertical axis. In the data analyses 
recorded herein the terms weight, mass and volume are used almost interchange¬ 
ably as measures of size. The allometry exponent is 3/4 for the data in Fig. 2 but 
empirically differs from this value when the data sets are graphed individually. The 
allometry coefficients of the separate data sets may vary as a function of habi¬ 
tat as well. The agreement between the biomass data and the AR with exponent 
3/4 is very suggestive but it must be viewed critically because of methodological 
limitations. 

Reich et a/. 174 analyzed data for approximately 500 observations of 43 perennial 
plant species of coupled measurements of whole-plant dry mass and Gt from four 
separate studies. Collectively, the observations span five of the approximately 12 
orders of magnitude of size in vascular plants. 53 The result of each experiment 
yielded an isometric scaling of b ~ 1 and not b = 3/4 as did the scaling of Gt to 
TBM for whole plants. Consequently, even when data look as appealing as they do 
in Fig. 2 things are not always what they seem. 


2.1.3. Clearance curves 

Another allometry phenomenon is the dependence of drug-dosing range on TBM 
and is referred to as clearance. 97 Zenobiotic clearance is the rate at which any foreign 
compound not produced by an organism’s metabolism is passed from the organism. 
The application of allometry ideas to pharmacokinetics and to determining human 
parameters from those in animals is fairly recent. 22,127,191 The early studies did 
not address questions of variability in the allometry parameters and were primarily 
concerned with whether the allometry exponent more closely tracked the value 2/3 
or 3/4 by doing linear regression on log-transformed data. 122 

Hu and Hayton 97 addressed the possible impact of statistical variability in the 
AR parameters on the predicted pharmacokinetic parameter values. They found 
considerable uncertainty in the value of the allometry exponent, which they fit to a 
Gaussian distribution with mean value 0.74. Even though they could not determine 
whether the variability in the allometry exponent was due to experimental error 
or to biological mechanisms they did find that there was no systematic deviation 
from the AR. However it appears that whether b = 3/4 or 2/3 depends on which 
of the major elimination pathways is used, metabolism for the 3/4 value and renal 
excretion for the 2/3 value. 
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2.1.4. Physiology 

The most studied of the interspecific ARs does not concern relative growth but is 
that associating the average BMR measured in watts to average TBM measured in 
kilograms of multiple species such that: 

B = aM b . (15) 

The metabolic rate refers to the total utilization of chemical energy for the genera¬ 
tion of heat by the body of an animal and is often measured by the oxygen intake 
during respiration. 

The earliest physiologic model for the value of the allometry exponent in the 
intraspecies AR was given by Sarrus and Rameaux. 188 Schmidt-Nielsen 195 records 
that this team of a mathematician and a physician reasoned that the heat gener¬ 
ated by a warm blooded animal is proportional to the volume and the heat loss 
is proportional to the animal’s free surface. Experiments on dogs by Rubner 183 
supported their argument and lead to the wide acceptance of the “surface law” in 
which b = 2/3. In Fig. 1 the “mouse-to-elephant” curve depicts the BMR for mam¬ 
mals and birds plotted versus TBM on log-log graph paper spanning six orders of 
magnitude. The solid line segment is the fit of the AR to the data and yields the 
empirical value b ~ 3/4. 

As Niklas 154 noted the expectation was that the metabolic rate would be pro¬ 
portional to the 2/3 power of the TBM as prescribed by the surface law. Surpris¬ 
ingly, this turned out not to be the case. The research of Kleiber 109 and Brody 24 
revealed that the slope was closer to 3/4 than to 2/3. Subsequent observational 
studies have reinforced the allometric pattern observed in the data predicted by 
Eq. (15) including some relating the 3/4-rule to plants, see, for example, Hem- 
mingsen. 86 Consequently the phenomenological value of the allometry exponent b 
remains controversial. 47,71,91 

Controversy also persists regarding the theoretical explanation as to why the 
allometry exponent b should have a specific value. The simple geometrical argument 
of Sarrus and Rameaux suggests b = 2/3 as reviewed in a number of excellent 
sources. 31,47,91 On the other hand, the quarter-power AR is explained by West, 
Brown and Enquist (WBE) using geometric scaling arguments from fractal physics 
to establish the value b = 3/4 and other quarter-power scaling laws in physiology, 
see Sec. 4. 

Heusner 91 adopted geometric scaling arguments to obtain b = 2/3 in the AR 
between BMR and TBM. He argued that the various other values experimentally 
observed for the power-law index by investigators are a consequence of differing 
values of the allometric coefficient a. He reasoned that two or more data sets with 
b = 2/3 but with different values of a graph as parallel line segments on log-log 
graph paper, but when the two or more data sets are grouped together and analyzed 
as a single data set the aggregate is fit by a single line segment with net slope 
b > 2/3. The same argument can be found in a number of other references. 176,195 
But unlike those earlier references Heusner 91 concluded that it is the allometry 
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coefficient that remains the central mystery of allometry and not the allometry 
exponent. We investigate the implications of Heusner’s conjecture in Sec. 3 where 
we numerically explore the implications of treating the allometry coefficient as a 
random variable. 

It is useful to list the various forms of physiological allometry given by Lindstedt 
and Schaeffer 122 : pulmonary and cardiac allometry with b = —1/4; renal allometry 
with b = —0.85; liver allometry with b = —0.85; pulmonary blood volume with 
b = 1.0; cardiac output with 6 = 3/4 and pulmonary transit times with b = 1/4. 


2.1.5. Information transfer 

There are literally dozens of physiologic ARs for physiologic time r, relative to clock 
time t, that increases with increasing body size r = aM b120 and describes chemical 
processes such as the turnover time for glucose with b = 1/4 9 to the life span of 
various animals in captivity with b = 0.20. 185 Schmidt-Nielsen 195 explains how a 
variety of physiologic time scales such as the length of a heart beat and respiration 
all scale with body size and from that deduce a number of interesting relations. 
It is only recently however that Hempleman et al 88 hypothesized a mechanism to 
explain how information about the size of an organism is communicated to the 
organs within the organism. Their hypothesis involved matching the neural spike 
code to body size to convey this information. 

Hempleman et al. 88 suggest that mass-dependent scaling of neural coding may 
be necessary for preserving information transmission with decreasing body size. 
They point out that action potential spike trains are the mechanisms for long 
distance information transmission in the nervous system. They go on to say that 
neural information may be “rate coded” with average spike rate over a time period 
encoding stimulus intensity or “time coded” with the occurrence of a single spike 
encoding the occurrence of a rapid stimulus transition. The hypothesis is that some 
phasic physiological traits are sufficiently slow in large animals to be neural rate 
coded, but are rapid enough in small animals to require neural time coding. These 
trait include such activities as breathing rates that scale with b = —1/4. 

They tested for this allometry scaling of neural coding by measuring action po¬ 
tential spike trains from sensory neurons that detect lung CO 2 oscillations linked to 
breathing rate in birds ranging in body mass from 0.045 to 5.23 kg. While it is well 
known that spike rate codes occur in the sensing of low frequency signals and spike 
timing codes occur in the sensing of high frequency signals, their experiment was 
the first designed to test the transition between these two coding schemes in a single 
sensory network due to variation in body mass. The results of their experiments 
on breathing rate was an allometry exponent in the interval —0.26 < b < —0.23 
and although taken on a small number of birds their results do suggest a preserva¬ 
tion of information transmission rates for high frequency signals in intrapulmonary 
chemoreceptors and perhaps other sensory neurons as well. The implications of 
these experiments strongly suggest the need to continue such investigations. 
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On the more theoretical side Moses et al . 149 apply the scaling ideas of metabolic 
allometry developed by West et al. 247 to information networks consisting of micro¬ 
processors to form a network. Moses et al . 149 use a fractal argument to construct a 
two-dimensional hierarchal self-similar branching network, an H-tree that Mandel¬ 
brot 131 originally used in his discussion of the space filling behavior of the human 
lung. They show that this branching network of microprocessors have a striking 
similarity to such networks in organisms even though the latter has evolved by 
natural selection and the former are designed by engineers. 

Along this same line E. F. Rent, while an IBM employee in the 1960s, wrote a 
number of internal memos (unpublished) relating the number of pins at the bound¬ 
aries of an integrated circuit (A) to the number of internal components (Y~), such 
as logic gates, to obtain an AR with b < 1.0. This rule has historically been used by 
engineers to estimate power dissipation in interconnections and for the placement 
of components in very large scale integrated (VLSI) circuit design. More recently 
Rent’s Rule has been used to model information processing networks in the human 
brain 16 where the mass of grey and white matter are shown to satisfy an AR as first 
noted by Schlenska . 194 Beiu and Ibrahim 18 suggested that the allometry exponent 
for grey and white matter between species is identical to the Rent exponent within 
a species and this was supported using MRI data by Bassett et al. 16 


2.2. Physical networks 

Some of the oldest ARs involve physical networks, or more specifically geophys¬ 
ical networks. The skeptic need only return to da Vinci’s scaling relation. In his 
notebooks da Vinci explains the meaning of this equation not only in the context 
of relating tree trunks to subsequent branches, but to the branchings of rivers as 
well. Long before the conservation of energy and the continuity of fluid flow were 
known to scientists, the enigmatic Italian painter, sculptor, military engineer and 
anatomist understood the basics of hydrologic networks. 

2 . 2 . 1 . Geology and geomorphology 

Horton’s law of river numbers is another empirical regularity observed in the topol¬ 
ogy of river networks . 95 As observed by Scheidegger 192 the number of river segments 
in successive order form a geometrical sequence such that, the 

number of rivers with k tributaries oc R\- k . (16) 

The bifurcation parameter R 5 is the constant ratio between successive numbers of 
river networks, known as Horton’s law of stream numbers nk/nk+i = Rb and has 
the empirical value between 4.1 and 4.7 in natural river networks , 159,160 in contrast 
to the random model that predicts a value of four. Note that the system of counting 
begins at the smallest tributary that are the most numerous Rb > 1 and these feed 
into larger tributaries that are fewer in number. 
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Dodds and Rothman 46 point out that universality arises when the qualitative 
character of a network is sufficient to quantify its essential features, such as the 
exponents that characterize scaling laws. They go on to say that scaling and uni¬ 
versality have found application in the geometry of river networks and the statistical 
structure of topography within geomorphology. They maintain that the source of 
scaling in river networks and whether or not such scaling belongs to a single uni¬ 
versality class is not yet known. They do provide a critical analysis of Hack’s law, 
see also, Rodriguez-Iturbe and Rinaldo. 182 


2.2.2. Hydrology 

Hack’s law is a hydrologic AR having to do with the drainage basins of rivers. Hack 83 
developed an empirical relation between mainstream length of a river network and 
the drainage basin area at the closure of the river. Hack’s law is given by, 

mainstreamlength oc (area )\ (17) 

where h is the Hack exponent with the typical empirical value h « 0.57. Hack 
asserted that river networks are not self-similar. 

Mandelbrot 131 relates Hack’s exponent to the fractal dimension of the river net¬ 
work and debates the interpretation of the equation. Feder 57 observed that defining 
a fractal dimension for river networks was obscure and required further study. A 
modern version of this discussion in terms of hydrologic allometry is given by Ri¬ 
naldo et al. 181 who point out that optimal channel networks yield h = 0.57 ± 0.02 
suggesting that feasible optimality 178 implies Hack’s law. Another viable model is 
given by Sagar and Tein 186 that is geomorphology realistic giving rise to general 
ARs in terms of river basin areas, as well as parallel and perpendicular channel 
lengths. 

Maritan et al. 136 consider an analogy with the metabolic AR using M oc B a , 
that is, a = 1/6 so that a = 1 + h with the limiting values a = 3/2 and h = 1/2 
in the case of geometric self-similarity. Geometric self-similarity is the preservation 
of the river’s shape as the basin increases in area. The observed values he in the 
range 1.5 < a < 1.6 and the scatter of individual curves (analog of intraspecies 
data) is remarkably small. These values suggest that the branching nature of rivers 
is fractal, that is, a > 3/2 in most cases. The ensemble average of Hack’s exponents 
from different basins extend over 11 orders of magnitude and is indistinguishable 
from h = 1/2. 146 Maritan et al 136 conclude that like the interspecies metabolic rate, 
the slope of the intraspecies h’s are washed out in the ensemble average, resulting 
in the value h = 1/2. 

2 . 3 . Natural History 

Natural History embraces the study, description and classification of the growth 
and development of natural phenomena. The focus of investigation includes such 
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important contemporary areas as ecology and paleontology, parts of which rely 
heavily on AR and scaling. 


2.3.1. Ecology 

Ecology is the scientific study of the distribution, abundance and relations of or¬ 
ganisms and their interactions with the environment. Such living networks include 
both plant and animal populations and communities along with the network of re¬ 
lations among organisms of different scales of organization. Of concern to us here 
are the species traits determined by body size and how these in turn affect food 
web stability. Woodward et a/. 261 along with many others point out that the largest 
metazoans, for example, whales (10 8 grams) and giant sequoias, weigh over 21 or¬ 
ders of magnitude more than the smallest microbes (10 -13 grams). 102,234 They go 
on to stress the considerable variation in body mass among members of the same 
food web. 

We do not address the traditional linking of species through such mechanisms as 
the prey-predator relation within food webs 76 but instead we focus on body size. 
The significance of body size has been systematically studied in ecology. 27,35,102 
Indentifying X with species abundance and Y with TBM in Eq. (2) there is, in 
fact, an AR between the species at the base of a food web and the largest predator 
at the top. 35 We note that species-area power functions have a vital history in 
ecology 166,258 even though the domain of sizes over which the power law appears 
valid is controversial. 25,256 

Woodward et al 261 emphasize that AR has been used to explain the observed 
relations between body size and each of: home range size, nutrient cycling rates, 
numerical abundance and biomass production. They speculate that body size may 
capture a suite of covarying species traits into a single dimension, without the 
necessity of having to observe the traits directly. 

Brown et al 26 discuss the universality of the documented ARs in plants, ani¬ 
mals and microbes; to terrestrial, marine and freshwater habitats; and to human- 
dominated as well as “natural” ecosystems. They emphasize that the observed 
self-similarity is a consequence of a few basic physical, biological and mathematical 
principles; one of the most fundamental being the extreme variability of the data. 
The variety of distributions of allometry coefficients and exponents are discussed 
both phenomenologically and theoretically in subsequent sections. 

Farr’s law 29 is an example of the change in ARs in the transition from organ- 
ismic to environmental allometry. Farr collected data on the number of patients 
committed because of their mental condition and their mortality from a variety of 
asylums in 1830s England. 56 From these humble beginnings he was able to sum¬ 
marize the “evil effect of crowding” into a relation between mortality rate R and 
population density p 96 : 


R = ap b . 


( 18 ) 
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Here we see that the size measure used in the metabolic AR, the mass, is replaced 
with a measure of community structure, the population density. The ARs that cap¬ 
ture life histories in ecology and sociology are often expressed in terms of numbers 
of animals and areas in addition to body mass. Calder 31 points out that size and 
time seem to be the principle characteristics of life history and ecology. 

The factors necessary for the formation of social groups with a restricted 
geographic area are not understood, but certain ARs help clarify them. The popu¬ 
lation density p of herbivorous mammals has been determined to be related to their 
mass M by 39 p~ l = cM 0-75 where the animals freely roam on a “home range” of 
given area: A h r = dM im . A similar relation exists for carnivores. 145 Consequently, 
as explained by Calder, 31 herbivorous mammals above a certain size range over an 
area greater than their per capita share of the local habitat. The degree of overlap 
was empirically determined to be 40 A^p = c"M ° -34 , where the empirical exponent 
0.34T0.11 is statistically indistinguishable from that obtained by combining the sep¬ 
arate exponents for the population density and area, that being, 1.02 — 0.75 = 0.27. 
Calder conjectures that the greater the product A^p the greater the intensity of 
competition and the greater the desirability of social networks that contribute to 
mutual tolerance within these groups. Makarieva et al 129 argue that animal home 
range represents a biological footprint of the undisturbed state of an econetwork, 
however the population density adapts to disturbances in the econetwork. Conse¬ 
quently, the deviation of the home range-population product from isometry reflects 
the degree of econetwork disturbance. 

The AR between maximum abundance and body size for terrestrial plants N oc 
M~ b was extended by Belgrano et al. 19 to the maximum population densities of 
marine phytoplankton with b = 3/4. They draw the implication that maximum 
plant abundance is constrained by rates of energy supply in both terrestrial and 
marine networks as dictated by a common AR. Earlier investigators found b > 
3/4. 37,49 


2.3.2. Zoology and acoustics 

Mice squeak, birds chirp and elephants trumpet due to scaling. Fitch 59 discusses the 
relationship between an organism’s body size and acoustic characterization of its 
vocalization under the rubric of acoustic allometry. Data indicate an AR between 
palate length (the skeletal proxy for vocal tract length) and body mass for a variety 
of mammalian species. He shows that the interspecies allometry exponent attains 
the geometric value of three in the regression of skull length and body mass, whereas 
the intraspecies allometry exponent varies a great deal. The significant variability 
in the intraspecies allometry exponent suggest taxon-specific factors influencing the 
AR. 137,199 

Fitch 59 gives the parsimonious interpretation that the variability in the in¬ 
traspecies allometry exponent could be the result of each species adopting allomet- 
ric scaling during growth as postulated by Huxley, with a different proportionality 
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factor for each species. On the other hand, the interspecies allometry exponent 
could result from the common geometric constraints across species due to the wide 
range of body sizes. He concludes that the AR between vocal tract dimensions 
and body size could provide accurate information about a vocalizer’s size in many 
mammals. 

2.3.3. Paleontology 

Pilbeam and Gould 164 provide reasons as to why body size has played such a 
significant role in biological macroevolution. The first is the statistical generalization 
known as Cope’s Law or Rule, which states that population lineages increase in 
body size over evolutionary time scales, 36 that is, the body size of a species is an 
indication of how long it has survived on geological time scales. A second reason is 
the one mentioned earlier, Galileo’s observation that large organisms must change 
shape in order to function in the same way as do their smaller prototypes. 

One quantitative measure of evolution is the development of the brain in mam¬ 
mals at various stages of evolution. Jerison" showed that the brain-body relation 
given by Eq. (2) is satisfied by mammals with an exponent that is statistically 
indistinguishable from 2/3. He suggested that a may be an appropriate measure 
of brain evolution in mammals as a class, following the proposal of Dubois 48 that 
a quantitative measure of cephalization in contemporary mammals be based on 
the ratio; a = brain weight/(body weight) 5 . These hypotheses were directly tested 
by Jerison" using endocranial volumes and body volumes for fossil mammals 
at early and intermediate evolutionary stages. The data did in fact support the 
hypothesis. 

White and Gould 253 emphasize in their review of the meaning of the allome¬ 
try coefficient a that it had generated a large and inconclusive literature. Reiss 176 
notes that if brain mass is regressed on TBM across individuals in a species the 
slopes are shallower than of regressions calculated across mean values for different 
species within a single family (genus). This argument had also been presented by 
Gould 80 who emphasized the importance of the allometry coefficient in the geomet¬ 
ric similarity of allometric growth. This interpretation of the allometry coefficient 
was at odds with the belief of the majority of the scientific community at the time 
that the allometry coefficient was independent of body size. This latter view is also 
contradicted by the data analysis in Sec. 3. 

Allometry has been used by Alberch et al 2 as the first step in creating a uni¬ 
fying theory in developmental biology and evolutionary ecology in their study of 
morphological evolution. They demonstrate how their proposed formalism relate 
changes in size and shape during ontogeny and phylogeny. 

3. Data Statistics 

In this section the phenomenology of analyzing AR measurement using statistics 
is discussed; collecting data, identifying patterns (laws) in the data and developing 
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methods of statistical analysis. Warton et al 220 point out that fitting a line to a 
bivariate data set is not a simple task and the AR literature is filled with debate 
over the proper methodology. Sir Julian readily adopted the statistical approach of 
linear regression to Eq. (3) on multiple data sets to determine the allometry coef¬ 
ficient a and exponent b. The sophisticated statistical techniques such as principle 
component analysis were not available to Sir Julian and although they can be found 
in the modern AR literature least-square regression still appears to be the method 
of choice. 73,189 

Measures of complex phenomena always contain uncertainty and the data sets 
invariably fluctuate from measurement to measurement. Thus, X(t) and Y(t) are 
stochastic variables, and by implication X(Y) is a random function of its argu¬ 
ment as well. Consequently the deterministic algebraic equations relating network 
size and network function never unambiguously represent what is actually being 
measured. Subsequently, we examine the corresponding mathematical modeling, 
shifting the focus from the dynamic stochastic variable to the associated dynamic 
pdf. 

We examine the phenomenology of the random data from measurements of var¬ 
ious properties of allometry networks that determine empirical ARs. In particular 
we focus on how allometry coefficients and exponents are interpreted given that the 
data on which they are based fluctuate to such a large extent. Part of the reason 
for taking this approach is that a significant number of scientists adopt the view¬ 
point that fluctuations reflect lack of control and/or ignorance about what is being 
measured. Here we adopt the more sympathetic view that networks are generically 
random because they are dynamic and complex in which case statistics provide 
information about the fundamental nature of that complexity. 


3.1. Fluctuations 

All complex dynamic networks are stochastic, either due to intrinsic nonlinear dy¬ 
namics producing chaos, 123,124,155 or due to coupling of the network to an infinite 
dimensional albeit unknown environment, 118 or both; completely aside from mea¬ 
surement errors. Consequently, it is necessary to understand how statistical un¬ 
certainty may be included in modeling allometry data. Kaitaniemi 104 pointed out 
that the potential information content of the allometry coefficient has been largely 
neglected, an observation also made by Glazier 73 among others. Kaitaniemi exam¬ 
ined the different ways this parameter may vary for different sources of random 
fluctuations. Here we follow a similar strategy, but we use actual data rather than 
computer generated random fluctuations. 

The normal or Gauss pdf suggests that the statistical variations between the 
variables in the AR Eq. (2) may be additive leading some scientists 70,130,203 to 
propose the form: 


Y = aX b + r ], 


(19) 
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where r] depicts the random fluctuations and the overbars denote the fitted values 
of the parameters. Packard and Boardman 157 also investigate the regression of data 
to a three-parameter power law that does not pass through the origin 

Y = Y 0 + aX~ b + 7]. (20) 

In the near isometry case where b ~ 1 linear regression analysis is appropriate 
and additive fluctuations provide a satisfactory representation of the statistical 
variability. On the other hand when the allometry exponent is substantially different 
from unity the determination of the nature of the fluctuations requires preliminary 
statistical analysis, see, for example, Packard. 156 

Packard and Boardman 157 emphasize that the additive form of fluctuations in 
ARs is quite different from the situation involving the logarithmically transformed 
data. For the transformed data introducing additive random fluctuations yields: 

logF = log a + blogX + r] , (21) 

and the empirical constants a and b are fit to the transformed data. In terms of the 
original AR we obtain: 

Y = ae v X l (22) 

with the fluctuations being exponentially amplified through e 77 . It is evident that 
when the fluctuations are considered to be focused in the allometry coefficient, as 
they are here, they are multiplicative. The multiplicative character of the fluctu¬ 
ations implies that the influence of the random variations is amplified far beyond 
their additive cousins. 

They emphasize that the focus of the research on log-transformed data is to 
characterize patterns of variation in morphology, physiology and ecology in organ¬ 
isms spanning a broad range in body size in an attempt to identify underlying 
principles in the design of biological networks, see, for example, Brown et al 27 and 
references therein. They go on to assert that many of the patterns identified by 
this research are inaccurate and misleading and these mischaracterizations likely 
contribute to the ongoing debate about ways in which animals are constructed. 

We saw that fluctuations in the log-transformed data may be equivalent to mul¬ 
tiplicative fluctuations in the original data. So the important question is whether it 
is necessary to perform the logarithmic transformation at all. Packard and Board- 
man 157 point out that the original motivation for the log-transform was to linearize 
the equations thought to represent the data and therefore facilitate the implemen¬ 
tation of graphical and statistical analysis. 162,203 However, they go on to show the 
biasing problems associated with log-transforms using computer generated data 
sets and caution that with the present day computer software for fitting nonlinear 
equations linearization is no longer a sufficient rationale for log-transforms. So are 
there other reasons to transform the data? 

Kerkhoff and Enquist 106 strongly disagree with the conclusions of Packard 
(2008) that standard methods for fitting allometry models produce “biased and 
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misleading” results. They point out that most biological phenomena are inherently 
multiplicative 63,66 and it is the proportional rather than absolute variation that 
matters. The multiplicative influence of noise seen in the log-transformed data is 
often misinterpreted as bias. 203,263 Kerkhoff and Enquist 106 maintain that the mul¬ 
tiplicative error model is an appropriate feature, rather than a defect, of standard 
allometry analysis. Recent research suggests that geometric error resulting from 
multiplicative fluctuations should be the default standard for parameter estimation 
in biology 66,81 and not additive error. 

3.2. Phenomenological distributions 

We now consider the statistics of the fluctuations in the AR using data from the 
literature. The data relating the average energy expended by a given species in watts 
to the average TBM of that species in kilograms for 391 species of mammal is plotted 
in Fig. 3 and also in Heusner 91 as well as in Dodds et a/. 47 A fit of Eq. (3) to these 
data that minimizes the mean-square error is a straight line on double logarithmic 
graph paper and was found to have slope b = 0.71 ± 0.008 so that empirically 
2/3 < b < 3/4 and the allometry coefficient a = 0.02. As West and West 245 reviewed 
Heusner 90 had somewhat earlier questioned Kleiber’s value of 3/4 and concluded 
from data analysis that this value of 3/4 was a statistical artifact. Feldman and 
McMahon 58 agreed with Heusner’s conclusions, but suggested that there was no 
compelling reason for the intraspecies and interspecies allometric exponents to be 
the same, with the intraspecies exponent being 2/3 based on geometric similarity 
and the interspecies exponent being 3/4 based on McMahon’s elastic similarity. 

There is a great deal of variability around the line segment that gives the AR 
model in Fig. 3. West and West 245 interpret these fluctuations as random variations 



Fig. 3. The linear regression to Eq. (3) for Heusner’s data 91 is indicated by the line segment. 
The slope of the dashed line segment is 0.71 ± 0.008. 
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in either the allometry coefficient or the allometry exponent. If the fluctuations are 
assumed to be contained in the allometry coefficient we can define 

£ = e n = JL 

a aM b 

so that each data point in the (X, T)-plane yields a single value of the allometry 
coefficient. In a complex network a linear response such as suggested by Fig. 3 does 
not necessarily occur since there can be independent fluctuations in both X and 
Y resulting in what Warton et a /. 220 call equation error; also known as natural 
variability, natural variation and intrinsic scatter. In the present case the AR is not 
predictive, but instead summaries vast amounts of data. 255 This natural variability 
is manifest in fluctuations in the allometry parameters (a, b). 

West and West 242 calculate the statistical distribution for the random allometry 
coefficient determined from Eq. (23) under the assumption that b is fixed. The 
variability in the allometry coefficient determined by the data is partitioned into 
20 equal sized bins in the logarithm of the allometry coefficient. A histogram is 
then constructed by counting the number of data points within each of the bins 
as indicated by the dots in Fig. 4. The solid line segment in this figure is the best 
fit to these 20 numbers with minimum mean-square error. The functional form 
for the histogram is indicated by the curve in Fig. 4 241 > 242 and the quality of the 
fit to the diversity data is determined by the correlation coefficient r 2 = 0.98. 
The normalized histogram G(lna') on the interval (0, oo) using the transformation 




Fig. 4. The histogram of the deviations from the prediction of the AR using the data depicted 
in Fig. 3 partitioned into 20 equal sized bins in the logarithm of the normalized variable a' = a/a. 
Here a = 0.02 and b = 0.71. The solid line segment is the best fit of Eq. (24) to the 20 histogram 
numbers, which yields the power-law index a = 2.79 and the quality of the fit is measured by the 
correlation coefficient r 2 = 0.98. (Reproduced from West and West 242 with permission). 
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G(lna')dlna' = P(a')da' gives the empirical pdf : 
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and a = 3.28 yielding a standard deviation 0.017 in essentially exact agreement 
with the empirical data. Note that this coefficient differs from the best fit value 
given in the caption of Fig. 4 but this results in only a 1% change in the value 
of r 2 . 

The same inverse power-law form is obtained with a = 3.89 and r = 0.96 using 
the avian BMR data of McNab 142 for 533 species of bird. The distribution of the 
deviations from the AR for both the avian and mammalian data sets fall off as 
inverse power laws on either side of a = a. Equation (24) quantifies the qualitative 
argument used earlier to associate power-law pdf’ s with multiplicative fluctuations, 
see also Sec. 5. 

Alternatively the fluctuations can be assumed to be contained within the allom¬ 
etry exponent as: 


log(-g/q) _ ^ 
log M 



If we assume b = 0.71 and a = a then Eq. (25) provides us with the statistical 
fluctuations in the allometry exponent are used to construct a histogram exactly as 
we did previously. The solid line segment in Fig. 5 is the best fit to the 20 numbers 
of the histogram with minimum mean-square error. The functional form for the 
histogram of deviations from the allometry exponent b is determined by the curve 
in Fig. 5 and the quality of the fit to the histogram is determined by the correlation 
coefficient r 2 = 0.97. The histogram is fit by the Laplace pdf 



^ exp [—/? | b 




with the empirical value /? = 12.85. 


3.2.1. Co-variation of allometry coefficient and exponent 

We have obtained two separate distributions for two different parametric representa¬ 
tions of the same data. As pointed out by Glaizer 73 species within a taxon represent 
a cloud of different metabolic rates and not the linear metabolic level determined by 
linear regression. His cloud is incorporated into the present context by abandoning 
the assumption that the allometry coefficient and exponent are independent and 
requiring that the probability of a given fluctuation is the same regardless of the 
representation so that V{a)da = ^(b)db. In order to calculate a nonzero Jacobian 
of the transformation between the two allometry parameters requires that they be 
functionally related. West and West 243 assume b = b — clna, so that by inserting the 
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Fig. 5. The histogram of the deviations from the prediction of the AR using the allometry 
exponent fluctuation data partitioned into 20 equal sized bins. The solid line segment is the best 
fit of Eq. (26) with Ab = b — b, to the 20 histogram numbers, and the quality of the fit is measured 
by the correlation coefficient r 2 = 0.97. (Reproduced from West and West 242 with permission). 

allometry exponent pdf Eq. (26) into that for equal probabilities and simplifying 
yields: 

for a < a 

(27) 

for a > a 

and comparing Eq. (27) with Eq. (24) they identify a = /3c and obtain the empirical 
pdf for the allometry coefficient P(a')da' = V(a)da. Using the fitted values a = 2.79 
and p = 12.85 yields c = 0.217 and consequently the empirical transformation can 
be written as: 

b = 0.71 — 0.50 log 10 a . (28) 

Note that this functional dependency of the allometry exponent on the allometry co¬ 
efficient is consistent with the empirical co-variation relation obtained by Glazier 73 
who used linear regression on large amounts of parametric data. West and West 243 
emphasize that this co-variation of the allometry parameters implies that there is 
no universal value for the allometry exponent. 

3.2.2. Other scaling distributions 

The tent shaped distribution in Fig. 5 also arises using a different approach to 
quantifying the variability of BMR. Labra et al 115 investigate BMR fluctuations 
by considering O 2 volume time series and examining the scaling of the high fre¬ 
quency fluctuations across species. They determined empirically that the standard 
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deviation in the BMR is proportional to the average BMR by the scaling law: 

SDbmr °c B X 

and A = —0.352 ± 0.072. This relation is known as the power curve in the ecology 
literature 85 and was discovered by Taylor 210 in his determination of the number of 
new species that can be found in a given plot of ground. It was expanded to the 
determination of scaling in time series by Taylor and Woiwold 211 and applied in 
physiology as Taylor’s law in West . 238 

On the other hand Labra et al 115 determined the standard deviation of BMR 
to be proportional to a power of the TBM: 

SDbmr °c M 1 

and 7 = —0.241 ±0.103. Consequently, combining the two empirical expressions for 
the variance we obtain: 

B oc M l/X (29) 

with 7/ A = 0.69 a value consistent with the many fits made to the allometry data. 
They determine that all the species they studied show the same invariant distribu¬ 
tion of BMR fluctuations, regardless of the difference in their phylogeny, physiology 
and body size. The distribution has the form Eq. (26) with the independent vari¬ 
able given by the fluctuation in the BMR A B and (3 = / S Dbmr; in terms of the 

scaling variable B sc = V2A£> /SDbmr curves for 12 different species collapse onto 
a single universal curve. 


3.2.3. Paleobiology and scaling 

Phenomena with intermittent properties described by inverse power-law statistics 
are apprarently ubiquitous 233 and paleobiology is no exception. Bak and Boettcher 8 
interpreted Charles Lyell’s 125 uniformitarianism as meaning that all geologic activ¬ 
ity should be explainable in terms of readily available processes working at all times 
and all places with the same intensity. They go on to argue that the existence of 
earthquakes, volcanic eruptions, floods and tsunamis all indicate that the physical 
world is not in equilibrium. Moreover the intermittent nature of the paleontological 
record indicates that macroevolution is also out of equilibrium and consequently 
the inverse power-law statistics are possibley suitable for their description. 

Eldredge and Gould 52 argued that punctuated change dominates the history of 
life and that relatively rapid episodes of speciation constitute biological macroevo¬ 
lution. The intermittency of speciation in time has been explained by one group 
as punctuated equilibria 51 and has been indirectly related to fractal statistics by 
identifying it as a self-organized critical phenomenon . 8 In the self-organized criti¬ 
cality model of speciation Bak and Boettcher 8 associate an avalanche of activity 
with exceeding a threshold and the distribution of returns to the threshold with a 
“devil’s staircase” having a distribution of steps of stasis of lengths given by the 
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inverse power-law pdf\ that is, T -7 with 7 = 1.75. Moreover, as explained by Snep- 
pen et al., 206 the number of genera IV, with a lifetime T can be fitted very well 
to a power law T~ ,s with f3 ~ 2. 173 More recently Rikvold and Zia 179 put forward 
an explanation of punctuated equilibrium based on 1// noise in macroevolutionary 
dynamics that also yields an inverse power-law pdf for the life time of ecological 
communities with (3 = 2 . 

Sole et al. 208 analyzed the statistics of the extinction fossil record (time series) 
and determined that the power spectrum has the form: 

s(f) cxi/r, 

with 0 < pb < 2. They find 0.80 < p < 0.90 and argue that these values support 
the self-organized criticality interpretation of extinction. On the other hand, Plot- 
nick and Sepkowski 165 also find a 1// power spectrum with a power-law index for 
extinction consistent with Sole et al. 208 and indices for species generation of ap¬ 
proximately half that for extinction. However the latter authors conclude that their 
results are incompatible with self-organized criticality and instead are compatible 
with multifractal self-similarity in both the extinction and generation records. 

4. Models of Allometry 

Two distinct methods dominate the many derivations of AR. One method is based 
on the first-principles reductionistic approach starting from an assumed form for the 
underlying mechanisms and from that deducing the necessity of Eq. (2). The other 
method is phenomenological involving statistical analysis and identifying patterns 
in the data analysis and from these patterns deducing the necessity of the empirical 
AR. A number of the statistical methods were reviewed in the previous section. The 
ARs stand out as empirical relations that have withstood the test of time, whereas 
the same cannot be said for the models developed to explain how they come about. 
In this section we examine attempts to formulate general principles from which the 
underlying mechanisms, whether reductionist or statistical, producing the ARs can 
be identified. 

The search for an unifying principle parallels such historical activities as the 
determination of “least action” in analytic mechanics or optimization in control 
theory. More recently investigators have rediscovered Newton’s “principle of simil¬ 
itude’” introduced in the Principia (II, Proposition 32). Scaling and the princi¬ 
ple of similitude have been present in the study of complex physical phenomena 
since physics became a science. In modern times it is RG theory that provides a 
formalism for determining how force is transferred across multiple scale. 103,111,259 
Part of the reason for exploring this approach is that fractal geometry and frac¬ 
tal statistics are able to capture some of the regularity observed in vast amounts 
of data in the life and social sciences in addition to the physical sciences. The 
implementation of fractal geometry and RG theory to study the architecture of 
physiological forms, 17,228,247 interacting networks of chemical reactions 45,78,169 and 
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the topology of ecological webs 27 over the past quarter century has lead to some 
remarkable insights. In particular the descriptive success of investigations using frac¬ 
tals 15,131,135,143 suggests that branching networks with fractal architectures have an 
evolutionary advantage 230,231,235 as do fractal stochastic processes . 236 

4.1. Optimization principles 

Optimization provides one of the best recipes for determining network dynamics 
consistent with a given set of constraints. It is through the specification of the 
constraints that insight into a phenomenon is introduced. The biological sciences 
employ ideas such as energy minimization; the minimal use of materials along with 
the maximization of efficiency. So whether the extremum is a minimum or maximum 
depends on one’s purpose and the quantity being varied. 


4.1.1. Energy minimization 

Scaling relations in living networks result from the balancing of various constraints. 
One such biological balancing is that of energy utilization against the energy cost of 
carrying out a biological function. A technique that has been used both implicitly 
and explicitly in the derivation of ARs is supplying nutrients to various parts of 
an organism through the venous and capillary networks as well as through the 
respiratory network. Murray 150 considered a fluid with viscosity v and laminar flow 
Q within a tube of length l and radius r. The flow had to overcome the vascular 
resistance that generates a pressure difference along the length of the tube given 
by Poiseuille’s law A p = SlnQ/irr 4 . 

A constraint on the flow is the cost of transporting fluid of cylindrical volume 
V = 7r lr 2 along the tube so that introducing c as the cost factor the total work to 
be done per unit time is given by: 

E = QAp + cV = ^ —j- cttIv 2 . (30) 

7rr 4 

Consequently, minimizing this expression with respect to the radius yields the opti¬ 
mal flow: Q = Cr 3 with the constant C = ctt 2 / 16n. The cubic dependence of the 
flow rate on radius is known as Murray’s law 223 and is indicative of the maximum 
efficiency of the flow. 

Murray 151 subsequently extended his result to bifurcating networks such as 
occurs in bronchial airways. The flow from a parent vessel of radius ro branches 
into two daughter vessels rq and rq such that the flow divides: 

Q o = Q i + Q 2 • 

Therefore inserting Murray’s law into this expression yields da Vinci’s equation with 
a = 3 and in the case of equal radii in the daughter branches rq = rq we obtain 
the scaling relation rq = 2 - 1 / 3 ro. Thus, the maximally efficient bifurcating network 
in terms of energy transport cost has radii decreasing as the cube root of two. As 
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Fig. 6. Sketch of a branching structure such as a blood vessel or bronchial airway with the 
parameters used in a bifrucating network model. 


pointed out by Weibel 223 this last result is also known as Murray’s law. However 
this law was actually first formulated by Hess 89 for blood vessels and subsequently 
is more properly called the Hess-Murray law. 


4.1.2. Optimal design 

Rashevsky 172 introduced the principle of optimal design in which the material used 
and the energy expended to achieve a prescribed function is minimal. He applied the 
principle to the basic problem of how the arterial network could branch in space 
in order to supply blood to every element of tissue. To address this problem he 
used the model of a bifurcating branching network supplying blood to a restricted 
volume and reducing the total resistance to the flow of blood. His purpose was to 
determine the condition imposed by the requirement that the total resistance is 
minimum. 

Here we assume the branching network is composed of N generations from the 
point of entry (0) to the terminal branches (TV). A typical tube at some intermediate 
generation k has length radius rk and pressure drop across the length of the 
branch A p^ as sketched in Fig. 6. The volume flow rate Qk is expressed in terms 
of the flow velocity averaged over the cross sectional area Uk : Qk = 7rrj \uk- Each 
tube branches into n smaller tubes with the branching of the vessel occurring over 
some distance that is substantially smaller than the lengths of the tubes of either 
generation. Consequently, the total number of branches generated up to generation 
k is Nk = n k . The pressure difference at generation k between the ends of a tube 
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is given by a suitably indexed version of Poiseuille’s law and the total resistance to 
the flow is given by the ratio of the pressure to flow rate 


n 


k = 


A p k 

Qk 


8 vlk 

4 

k 



The total resistance for a network branch with m identical tubes in parallel is 
1/m the resistance of each individual tube. Thus, in this oversimplified case we can 
write the total network resistance as: 


Q 


T — 




In order to minimize the resistance for a given mass Rachevsky first expressed the 
initial radius ro in terms of the total mass of the network. The optimum radii 
for the different branches of the bifurcation network having the total mass M are 
then determined such that the total resistance is a minimum dflr/drj = 0 yielding 
the equality ry = N k l ^r$. The ratio of the radii between successive generations 
is Vk+i/rk = (Ny/Ny+iY^so that inserting the number of branches at the kth 
generation Ny = n k yields: 

r k+1 /r k = n~ 1/3 , (33) 


yielding an exponential reduction in the branch radii across generations. Note the 
formal similarity of this ratio to Horton’s law in which the ratio of numbers of river 
branches is independent of generation number. 

Rashevsky considered the bifurcating case n = 2 where the ratio of radii reduces 
to ry+i/ry = 2 -1 / 3 = 0.794. This is the classic “cube law” branching of Thomp¬ 
son 213 in which he used the “principle of similitude”. The value 2 -1 / 3 was obtained 
by Weibel and Gomez 222 for the reduction in the diameter of bronchial airways for 
the first ten generations of the bronchial tree. However they noted a sharp devia¬ 
tion away from this constant fractional reduction after the tenth generation. The 
value 2 -1 / 3 was also obtained by Wilson 260 who explained the proposed exponen¬ 
tial decrease in the average radius of a bronchial tube with generation number by 
showing that this is the functional form for which a gas of given composition can 
be provided to the alveoli with minimum metabolism or entropy production in the 
respiratory musculature. He proposed minimum entropy production 69 as the design 
principle for biological networks to carry out a given function. 

The deviation from classical (exponential) scaling above generation ten shown in 
Fig. 7 was eventually explained using an alternative model of the bronchial airways 
in terms of fractal statistics 228 as we subsequently discuss. 


4.2. Metabolic allometry 

Barenblatt and Monin 14 proposed that metabolic scaling might be a consequence 
of the fractal nature of biology, but they did not provide a mechanistic model for its 
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Fig. 7. As the bronchial tree branches out, its tubes on average decrease in size. A theory con¬ 
sistent with the Principle of Similitude 213 predicts that their diameters should decrease by about 
the same ratio from one generation to the next; exponential decline. This semilog graph shows 
measurements from Weibel and Gomez 222 for 23 generations of bronchial tubes in the human 
lung. The prediction is a straight line that fits the anatomic data (dots) until about the tenth 
generation, after which they deviate systematically from an exponential decline, [adapted from 
West and Goldberger. 229 ] 

description. This shortcoming has been overcome by a number of investigators who 
have devised numerous fractal models to describe AR in a variety of contexts. 228,247 


4.2.1. Elastic similarity model 

The first model to analytically predict the allometry exponent 3/4 was constructed 
by McMahon. 139,140 His argument rests on the observation that the weight of a col¬ 
umn increases more rapidly with size than does its strength. Moreover as discussed 
by Schmidt-Nielsen 195 if the column is tall and slender it can fail due to elastic 
buckling in which small lateral displacements exceed the elastic restoring forces. 
For a sufficiently slender column with Young’s elastic modulus E and density p the 
critical length of a column of diameter d is: 

l c r = k(E/p) 1/3 d 2/3 (34) 

and k is a constant. The elastic criteria of McMahon is therefore given by l 3 oc d 2 . 
The weight of the column is given by the product of the density, length and cross- 
sectional area, 

Mg = pd 2 / 3 7rd 2 /4, 

where the length has been replaced using the elastic criteria yielding d oc M 3 / 8 . 
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The implications for allometry of the scaling between cylinder diameter and 
mass was discussed by Calder 31 using the symmorphosis hypothesis of Taylor and 
Weibel : 212 .. is regulated to satisfy but not exceed the requirements of the func¬ 
tional system ”. 237 First off we recognize that locomotion requires the contraction 
of muscles. During contraction, muscles exert a force that increases with the cross- 
sectional area of the muscle. The power output of the muscle is the work done per 
unit time and may be equated with the metabolic rate, the product of the force 
generated by the muscle and the velocity u of the shortening of the muscle B oc d 2 u. 
The velocity of the shortening of the muscle appears to be a size-independent con¬ 
stant from species to species 92 so that using these two equations for the diameter 
yields: 

B oc M 3/4 . 

Consequently, the allometry exponent 3/4 in McMahon’s elastic similarity model 
is required to maintain the flow of energy to the working muscles and is consistent 
with the principle of symmorphosis. 

Versions of the above argument given by both Calder 31 and Schmidt-Nielsen 195 
seem to explain the value of the allometry exponent for warm blooded animals. 
Dodds et a /. 47 critique McMahon’s model by noting that there is no compelling 
reason why the power output of muscles should be the dominant factor in the scaling 
of BMR. Moreover, Savage et al 189 point out that while the elastic similarity model 
might apply to the bones of mammals or the trunks of trees that have adapted to 
gravitational forces, it is doubtful that it is applicable to aquatic or unicellular 
organisms that also display an allometry exponent of 3/4 . 87 

4.2.2. WBE model 

West, Brown and Enquist 247 (WBE) published a quantitative model of metabolic 
AR that has had significant impact on how a significant fraction of today’s bi¬ 
ology/ecology community understands metabolic ARs. WBE model nutrient dis¬ 
tribution within a hierarchal network in which vessels become narrower, shorter 
and more numerous between successive levels proceeding from the initial to the 
terminal level reminiscent of representations of river branchings. The scaling in 
the transport network is a consequence of the constraints imposed by three as¬ 
sumptions: (1) The entire volume of the organism is crammed with a space-filling 
branching network. (2) The tube properties at the terminus of the network are 
size-invariant. (3) The energy required to distribute resources using this network 
is minimal, that is, the hydrodynamic resistance of the network is minimized. We 
note their claim that this model is the origin of universal scaling laws in biol¬ 
ogy 248,249 with b = 3/4. However we recall at the start that the existence of an 
empirical exponent b = 3/4 for metabolic AR has been questioned by numer¬ 
ous investigators 47,91,113,114,128 and we address these concerns and others in due 


course. 
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WBE introduce two parameters to characterize the network branching process, 
Pk = rk+i/rk the ratio of successive branch radii, as was done in the energy mini¬ 
mization arguments 172 and the other is 7 *, = Ik+i/hi the ratio of successive branch 
lengths. They conclude that the total number of terminal branches scales with TBM 
as Nt = (M/Mq) 5 . The total number of branches at level k is Nk = n k . Conse¬ 
quently at the network terminus the self-similarity of the network yields Nt = n N 
and consequently the total number of branches is: 


N = Mn(M/M 0 )/lnn. (35) 

Rashevsky’s energy minimization argument indicates that the transport of nutrients 
in complex networks is maximally efficient when pk is independent of k. This is the 
“fractal” scaling assumption made by WBE. 

The estimates of the ratio parameters required two separate assumptions. To 
estimate the ratio of lengths WBE assume that the volume of a tube at generation k 
can be replaced by a spherical volume of diameter Ik and in this way implement the 
space-filling assumption. The conservation of volume between generations therefore 
leads to the space filling condition: 

{h+i/h) 3 ~ Nk/Nj, c+i = n 1 


and consequently to the /^-independent parameter 


7 = 7 k=n 



They maintain that this level-independent scaling of the lengths is a generic prop¬ 
erty of all the space-filling networks they consider. This condition for an n-branching 
network with a reduction in size of n -1 / 3 between successive generations yields us¬ 
ing the definition for the fractal dimension D = log nj log n 1 / 3 = 3. 

A separate and distinct assumption is made to estimate [3 using the classic 
rigid-pipe model to equate the cross-sectional areas between successive generations: 
7rr 2 = n7rr 2 +1 , so that we have 

P = Pk = n~ 1/2 . 


Note that this scaling of /3 differs from the ratio parameter obtained using energy 
minimization. Inserting these values of the scaling parameters into the expression 
for b yields the sought after exponent b = 3/4. This value of the exponent in turn 
determines a number of other quarter-power scaling laws. 

WBE point out that this is strictly a geometrical argument applying only to 
those networks that exhibit area-preserving branching. Moreover the fluid velocity 
is constant throughout the network and it is independent of size. They go on to 
say that these features are a natural consequence of the idealized vessel-bundle 
structure of plant vascular networks in which area-preserving arises automatically 
because each branch is assumed to be a bundle of n N ~ k elementary vessels of the 
same radius. They recognized that this is not the situation with vascular blood 
flow where the beating of the heart produces a pulsating flow that generates a very 
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different kind of scaling. Area-preserving is also not true in the mammalian lung 
where there is a distribution of radii at each level of branching. 

A physical property that the area preserving condition violates is that blood 
slows down in going from the aorta to the capillary bed. Here WBE return to the 
principle of energy minimization and as stated by West 248 assert that to sustain a 
given metabolic rate in an organism of fixed mass, with a given volume of blood, 
the cardiac output is minimized subject to a space-filling geometry. This variation 
is essentially equivalent to minimizing the total impedance since the flow rate is 
constant and again yields the Hess-Murray law (3 = n -1 / 3 corresponding to area- 
increasing branching. 6,190 This change in scaling from the area-preserving n -1 / 2 
to the area-increasing n -1 / 3 solves the problem of slowing down blood flow to 
accommodate diffusion at the capillary level. Moreover, the variation also leads to 
an allometry exponent 6=1. Such an isometric scaling (6=1) suggests that plants 
and animals follow different allometry scaling relations as was found. 174,201 

A detailed treatment of pulsate flow is not straightforward and will not be 
presented here, but see Savage et al. 190 ; Silva et al 201 and Apol et al. 6 for details 
and commentary in the context of the WBE model. We do note that for blood flow 
the walls of the tubes are elastic and consequently the impedance is complex, as is 
the dispersion relation that determines the velocity of the wave and its frequency. 
Consequently pulsate flow is attenuated 32,60 and WBE argue that the impedance 
changes its r-dependence from r~ 4 for large tubes to r~ 2 for small tubes. The 
variation therefore changes from area-preserving flow /3 = n -1 / 2 for large vessels to 
dissipative flow (3 = n -1 / 3 for small vessels where blood flow is forced to slow. Thus 
[3k is /c-dependent in the WBE model for pulsate flow and at an intermediate value of 
k the scaling changes and this changeover value is species dependent. These results 
are contradicted in the more extensive analysis of pulsate flow by Apol et a/., 6 who 
conclude that Kleiber’s law remains theoretically unexplained. 

Kozlowski and Konarzewski 113 critique the apparent limitations of the WBE 
model assumptions. The size-invariance assumption has been interpreted by them 
to mean that Nt oc M, that is, the terminal number of vessels scales isometrically 
with size and consequently causes the number of levels to be a function of body 
size since more levels are required to fill a larger volume with the same density of 
final vessels. However Brown et al 28 assert that NtVt oc M so that 

N t oc M 3/4 (36) 

to which Kozlowski and Konarzewski 114 respond that such mass dependence is 
an arbitrary assumption and is not proven, see also Dawson. 43 Etienne et a/. 54 
reconstruct the WBE model without making the self-similarity assumption and in 
so doing satisfy the concerns of Kozlowski and Konarzewske. 113,114 The arguments 
remain unresolved and Cyr and Walker 38 refer to the assumption embodied in 
Eq. (36) as the illusion of mechanistic understanding and maintain that after a 
century of work the jury is still out on the magnitude of the allometry exponents. 
Riisgard 180 argues that respiration and growth are integrated through the energetic 


148 


D. West & B. J. West 

costs of growth and that this explains why the b value is not a “natural constant” 
and that a “3/4 power scaling law cannot be deduced from the interplay between 
pure physical and geometric constraints of the transport of oxygen. 

A quite different critique comes from Savage et al 190 who emphasize that the 
WBE model is only valid in the limit TV oo, that is, for infinite network size (body 
mass) and that the actual allometry exponent predicted depends on the sizes of the 
organisms considered. They calculate that the AR between BMR and TBM have 
corrections for finite N given by: 

M = ai B + a 2 B A/3 (37) 

from which it is clear that 6 = 3/4 only occurs when a 2 B x ! 3 2> ai. This inequal- 
ity is not satisfied for bodies of finite size. In their original publication WBE ac¬ 
knowledged the potential importance of such finite-size effects, especially for small 
animals, but the magnitude of the effect remained unspecified. Using explicit ex¬ 
pressions for the coefficients in the WBE model Savage et al 190 show that when 
accounting for these corrections over a size range spanning the eight orders of mag¬ 
nitude observed in mammals a scaling exponent of b = 0.81 is obtained. Moreover in 
addition to this strong deviation from the desired value of 3/4 there is a curvilinear 
relation between the TBM and the BMR in the WBE model given by: 

In M = In <22 H— In B + In ( 1 4—- B _1 ^ 3 

3 V a 2 

whose curvature is opposite to that observed in the data. Consequently they con¬ 
clude that the WBE model needs to be amended and/or the data analysis needs 
reassessment to resolve this discrepancy. A start in this direction has been made 
by Kolokotrones et al 110 Agutter and Tuszynski 5 also review the evidence that the 
fractal network theory for the two-variable AR is invalid. 

Another variation on this theme was made by Price et al 168 who relax the 
fractal scaling assumptions of WBE and show that allometry exponents are highly 
constrained and covary according to specific quantitative functions. Their results 
emphasize the importance of network geometry in determining the allometry expo¬ 
nents and supports the hypothesis that natural selection minimizes hydrodynamic 
resistance. Moreover they extended McMahon’s elastic similarity model and ap¬ 
ply it to plant scaling exponents showing its consistency with their modification of 
WBE. 

Prior to WBE there was no unified theoretical explanation of quarter-power 
scaling. Banavar et al. 12 show that the 3/4 exponent emerges naturally as an upper 
bound for the scaling of metabolic rate in the radial explosion network and in 
the hierarchical branching networks models and they point out that quarter-power 
scaling can arise even when the underlying network is not fractal. 

Finally, Weibel 224 presents a simple and compelling argument on the limitations 
of the WBE model in terms of transitioning from BMR to the maximal metabolic 
rate (MMR) induced by exercise. The AR for MMR has an exponent b = 0.86 
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rather than 3/4, so that a different approach to determining the exponent is needed. 
Painter 158 demonstrates that the empirical allometry exponent for MMR can be 
obtained in the manner pioneered by WBE by using the Hess-Murray law for the 
scaling of branch sizes between levels. 

Weibel 224 argues that a single cause for the power function arising from a fractal 
network is not as reasonable as a model involving multiple causes, see also Agut- 
ter and Tuszynski. 5 Darveau et al 41 propose such a model recognizing that the 
metabolic rate is a complex property resulting from a combination of functions. 
West et al 251 and Banavar et al. 11 demonstrate that the mathematics in the dis¬ 
tributed control model of Darveau et al. A1 is fundamentally flawed. In their reply 
Darveau et alf 2 do not contest the mathematical criticism and instead point out 
consistency of the multiple-cause model of metabolic scaling with what is known 
from biochemical 205 and physiological 101 analysis of metabolic control. The notion 
of distributed control remains an attractive alternative to the single cause mod¬ 
els of metabolic AR. A mathematically rigorous development of AR with fractal 
responses from multiple causes was recently given by Vlad et al 218 in a general 
context. This latter approach may answer the formal questions posed by many of 
these critics. 


4.3. Why fractal transport? 

Why are fractals important in the design of allometry networks? Barenblatt and 
Monin 14 suggested that metabolic scaling might be a consequence of the fractal na¬ 
ture of biology and WBE determined that fractal geometry maximizes the efficiency 
of nutrient transport in biological networks. Weibel 223 maintains that the fractal 
design principle can be observed in all manner of physiologic networks quantifying 
the observations and speculations of Mandelbrot, 131,132 as does West. 238 

West 230 conjectured that fractals are more adaptive to internal changes and to 
changes in the environment than are classical processes and structures. Consider a 
network property characterized by classical scaling at the level k such as the length 
or diameter of a branch Fy oc e Xk compared with a fractal scaling characterization 
of the same property Fy oc k~ x . What is significant in these two functional forms for 
the present argument is the dependence on the parameter A. The exponential has 
emerged from a large number of optimization arguments and the inverse power-law 
results from RG scaling arguments. 

Assume the parameter A is the sum of a constant part Ao and a random part £. 
The random part can arise from unpredictable changes in the environment during 
morphogenesis, non-systematic errors in the code generating the physiologic struc¬ 
ture or any of a number of other causes of irregularity. Thus, regardless of whether 
the errors are induced internally or externally, the average is taken over an ensemble 
of zero-centered Gaussian fluctuations £ with variance a 2 /2. Note that the choice 
of Gauss statistics has no special significance here except to provide closed form 
expressions for the averages to facilitate discussion. The relative error generated by 
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Fig. 8. The error between the model prediction and the prediction averaged over a noisy param¬ 
eter is shown for the classical model (upper curve) and the fractal model (lower curve). 


the fluctuations is given by the ratio of the average value to the function in the ab¬ 
sence of fluctuations, yielding the relative error for classical scaling £k = exp[cr 2 k 2 
and for fractal scaling = exp[cr 2 (ln k) 2 }. 

The two error functions are graphed in Fig. 8 for fluctuations with a variance 
cr 2 =0.01.At/c = 15 the error in classical scaling is 9.5. This enormous relative error 
means that the perturbed average property at generation 15 differs by nearly an 
order of magnitude from what it would be in an unperturbed network. A biological 
network with this sensitivity to error would not survive for very long in the wild. For 
example, the diameter of a bronchial airway in the human lung could not survive 
this level of sensitivity. However, the property of the fractal network only changes 
by 10% at the distal point k = 20. The implication is that the fractal network is 
relatively unresponsive to fluctuations. 

A fractal network is consequently very tolerant of variability. This error tolerance 
can be traced back to the broadband nature of the distribution in scale sizes of a 
fractal object. This distribution ascribes many scales to each generation in the 
network. The scales introduced by the errors are therefore already present in a 
fractal object. Thus, the fractal network is preadapted to variation and is therefore 
insensitive to change. 230,231 These conclusions do not vary with modification in the 
assumed statistics of the errors. Therefore let us review how fractal statistics have 
been employed in the understanding of some nonmetabolic physiologic ARs. 


4.3.1. WBG bronchial tree 

The energy minimization argument applied to a branching network resulted in 
Horton’s law. However, the theoretical arguments justifying this empirical law and 
others like it assume that the network is geometrically self-similar. Real networks, 
such as the bronchial tree in the mammalian lung, do not have such determinis¬ 
tic regularity. At any generation of the bronchial airway there is a distribution of 
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lengths and diameters, with a reduction in the average length and average diam¬ 
eter between successive generations. Consequently, the simple characteristic scale 
governing the decrease in bronchial dimensions across generations given by the 
Hess-Murray law must be reexamined. West, Barghava and Goldberger 228 (WBG) 
assumed the mammalian lung to be a fractal structure having a distribution of 
scales contributing to the variability in tube size at each generation. This model 
yields an average bronchial diameter that decreases with generation number, not 
as an exponential as suggested by Weibel and Gomez 222 but as an inverse power 
law . 229 ’ 232 ’ 233 

WBG assume that the variability in the diameters of the bronchial tubes at 
generation k is given by ^(yfc), where 7 is a random variable that determines the 
size of the diameter. In the energy minimization argument we determined that 
7 = In 2/3 and the classical reduction in diameter with generation number was 
exponential d(pfk) = d^e~ lk as shown in Fig. 7. However with real data this is the 
reduction in the average diameter just as it was for other allometry networks so that 
the random variation in diameter at generation k is smoothed over to determine the 
average. Formally the behavior of the average diameter as a function of generation 
number is given by: 

d{k) = J d(^k)p(^)d~/ , (38) 

with £>( 7 ) the pdf in scale size. Rather than prescribe a particular functional form to 
this pdf in the formal definition WBG use a RG argument originally developed in an 
economic context by Montroll and Shlesinger . 148 Assuming the original distribution 
of scales has an average value 7 the RG argument constructs a new distribution with 
an infinite number of new scales each a factor of [3 larger than the preceding scale 
and each occurring with a relative frequency ( less often. The distribution resulting 
from the RG relation gives rise to an empirical average denoted by brackets in terms 
of the formal average Eq. (38) as follows 234,235 : 

(#))=((« + (l-0#). (39) 

The dominant behavior for the solution to the RG relation Eq. (39) is determined 
by the singular part of the solution to the equation, which for simplicity WBG write 
with a subscript (d(k)) s = ((d(/3k)) s . Shlesinger and West 200 note that the solution 
to this scaling equation that provides the best fit to the data can be written as the 
real part of: 

00 A 

(d(k)) s = ^2 -jft’ = b + i2nn/ In/?; (40) 

n— — oo 

fi n is the complex fractal dimension of order n for the branching process, with the 
exponent b = — ln£/ln /? > 0 and the A n are fit by experimental data. Of course 
there is in addition to Eq. (40) an analytic part to the solution that WBG calculated 
but which becomes negligible with increasing k. 
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Thus, the average diameter is an inverse power law in the generation number 
modulated by a slowly oscillating function just as observed in the data depicted in 
Fig. 9. This fractal model of the bronchial airway provides an excellent fit to the 
average bronchial tube diameter data in four distinct species: dogs, rats, hamsters 
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Fig. 9. Top: Reploted data from Fig. 7 on log-log scales for humans. Center: The harmonic 
variation in measurements for dogs, rats and hamsters taken by Raabe et al. (1976). Bottom: The 
average diameters for the bronchial tubes of a hamster and RG (fractal) model prediction using 
Eq. (40). [Adapted from West and Goldberger 228 ]. 
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and humans. 152 The quality of the fit is shown for hamsters in Fig. 9, which restricts 
the summation index in Eq. (40) to n = 0,1,—1 and A\ = A-\ yields a single 
oscillation of the modulation function fit to the data. 

The form of Eq. (40) strongly suggests that the RG relation for the average di¬ 
ameter captures a fundamental property of the structure of the lung that is distinct 
from classical scaling. There is an AR at each level of the bronchial tree with the 
average diameter decreasing as k~ b with the level being a measure of size. Moreover 
the data show the same type of scaling for bronchial tube lengths and consequently 
volume. 

4.3.2. Distribution of bronchial diameters 

Kitaoka and Suki 107 use a related but distinct analysis of the lung data depicted 
in Fig. 9 in which they investigate the statistical variability of the bronchial airway 
diameters. Their analysis was based on the assumed relation between the airflow 
rate and the airway diameter d : 

Q = Cd a , 

a relation discussed in terms of energy optimization by Murry 150 yielding a = 3. 
The relation was also considered in a bronchial airway context by a number of other 
investigators. 82,105,163 The data of Raabe et al 171 reveals that the flow rate is a 
random variable and that the cumulative probability for the flow rate to exceed Q 
is given by: 

Pr(> Q) a 1 /Q 

with an inverse power-law index of —1 to three significant figures. Note that from 
the additive form of the flow rate between successive generations and the da Vinci 
relation between parent and daughter branches that the pdf for the diameter of the 
bronchial airway is determined to be: 

P (d)=cd , c= a f° dT r (4i) 

with do the diameter of the thorax and dr the diameter of the smallest bronchial 
airway. Here again the empirical distribution for the observable, the bronchial tube 
diameter rather than the allometry coefficient, is an inverse power-law (Pareto) pdf 
Note that in the previous subsection the average diameter was expressed as an 
inverse power law in the generation number. The distribution in the size of the 
bronchial airway Eq. (41) does not directly depend on generation number and is an 
inverse power law in diameter size, that is, in millimeters as depicted in Fig. 10. The 
data used to construct Fig. 9 was also used to determine the cumulative distribution 
of airway diameters and yields the inverse power-law distribution with a = 3.1 with 
correlation coefficient of r 2 = 0.988. Note that this value of the exponent is slightly 
different from the optimum as we discuss in the next section. It is also noteworthy 
that the oscillations around the inverse power law given by the straight line are the 
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Fig. 10. Cumulative distribution of diameters on log-log plot. The regression model over the full 
range of diameters yields a = 3.1 with r 2 = 0.988. [Reproduced from Kitaoka and Suki 107 with 
permission]. 

same periodic variation explained using WBG theory except that the oscillations 
are now a function of distance rather than generation number. 


4.3.3. The optimum can be dangerous 

A question of interest is whether optimal design criteria are ever realized in nature 
and whether they are even desirable. The WBE model suggests that the allometry 
exponent value of 3/4 is proof of the optimally of fractal design of networks for nu¬ 
trient transport. However the controversy over the empirical value of the allometry 
exponent calls this implication into question. The present discussion regarding the 
mammalian lung and whether the bronchial tree is optimal has been investigated 
by Mauroy et al 138 They maintain that the bronchial tree of most mammalian 
lungs is a good example of an efficient distribution network with an approximate 
fractal structure. 152,228 They state that physical optimization is critical in that 
small variations in the geometry can induce large variations in the net air flux and 
consequently optimality cannot be a sufficient criterion for physiologic design of the 
bronchial tree. The slight deviations observed in the parameters presumed to be 
optimized are a manifestation of a safety factor being encorporated into the design 
and into the capacity for regulating airway caliber. 
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In the present context the size ratio h of successive airway segments are homoth- 
etic with h = 2- 1 / 3 «0 .79 as discussed earlier. Homothetic scaling means that the 
lengths and diameters have the same ratios between successive generations. Using 
the resistance minimization argument for the bronchial network Mauroy et a/. 138 
show that the “best” bronchial tree is fractal with constant reduction factor given 
by the Hess-Murray law. Do the data support this optimal value and if not what 
does that imply about the efficiency of bronchial airways? 

The fractal dimension for a bronchial airway is D = — In 2/ In h so that the 
Hess-Murray law implies D = 3 whereas h > 0.79 implies D > 3. In the human 
lung it is found that the homothety ratio is h ~ 0.85 224 and the bronchial network 
is therefore not optimized: its volume is too large and its overall resistance is too 
small. Mauroy et a/. 138 emphasize that this deviation from optimal is, in fact, a 
safety margin for breathing with respect to possible bronchial constrictions. 

Sapoval 187 has argued that without regulation of the airway caliber 170 there 
would be a multifractal spatial distribution of air within the lungs, resulting in 
strongly nonuniform ventilation with some regions of the lung being poorly fed 
with fresh air. Expanding on this theme using inhomogeneity of the homothety 
ratio Mauroy et ai 138 show how the optimal network is dangerously sensitive to 
physiological variability and consequently design of the bronchial tree must incor¬ 
porate more than just physical optimality. This argument has clear implications for 
other allometric networks. 

5. Probability Calculus and ARs 

A fractal processes is one that is rich in scales with no one scale being dominant. 
Thus, information in fractal phenomena is coupled across multiple scales manifest¬ 
ing long-time memory, as for example, observed in the architecture of the mam¬ 
malian lung, 152,223,228 manifest in the long-range correlations in human gait 84,235 
and measured in the human cardiovascular network, 161 all of which are discussed in 
West. 238 These phenomenological characterizations of fractal time series relate back 
to the observation made earlier that if X (t) and Y (t) are stochastic time series then 
Y(X) must be a random function of its random argument as well. Consequently 
the general approach to determining the relation between such variables is through 
their pdf' s. 

In this section we examine pdf 's that can produce the interspecies metabolic 
ARs as a relation between moments. To do this requires a brief review of some 
methods from nonequilibrium statistical physics 118,175 and their extension to frac¬ 
tional equations of motions. 108,236 

5.1. Stochastic differential and Fokker-Planck equations 

There are two major techniques available in statistical physics for the modeling of 
stochastic phenomena. The first technique uses the dynamic equations constructed 
by Langevin 117 who introduced uncertainty through a random force in the equations 
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of motion. The second technique is based on the phase space evolution of the pdf 
using the Fokker-Planck equation (FPE). The conditions under which these two 
methods are equivalent have been shown in a number of places, see, for example, 
Lindenberg and West. 118 


5.1.1. Additive fluctuations 

Consider the dynamics of a simple exponential relaxation process Z(t) that is dis¬ 
rupted by a random force £(£): 

ZA1 = _ X Z{t) + m . (42) 

We assume the statistics of the random force are normal with a delta correlated 
correlation of strength D 

mat')) = 2 DS(t -1'). (43) 


An alternative description of the dynamics is given by the evolution of the prob¬ 
ability density in phase space where P(z , t)dz is the probability that the dynamic 
variable Z{t) has a value in the interval (z, z + dz) at time t given a value q{t) at 
the origin z = 0. The phase space dynamics are formally expressed by the equation: 

G (J? Wz) P<yZ ’^ =q ^ 6 ^- ( 44 ) 


The analytic function G(-, •) of the indicated operators along with the inhomoge¬ 
neous term q(t) determine the dynamics of the process modeled by the pdf 

A phase space equation for a classical diffusion process with linear dissipation 
equivalent to Eq. (42) is determined by: 


G 


d d 


) = f- 

d 

d ' 

A z + D ^ 

J dt 

dz 

dz 


dt ’ dz / 

The corresponding FPE is given by 217 : 


and q{t) = 1 



dP(z,t\zoflo) d 


dt 


dz 


Xz T D 


d_ 

dz 


P(zfl\z 0 flo ), 



where P(zfl\zoflo)dz is the probability that the dynamic variable lies in the phase 
space interval (z + dz,z) at time t conditional on Z( 0) = zq at time t = to. The 
solution to Eq. (46) for to = 0 is given by: 


P(z,t\z 0 ) = 


1 


y / 47rcr 2 (t) 


exp 


2 d 2 (t) 


(47) 


with average value (z;t) = zoe~ xt and variance cr 2 (t) = D/ A(1 — e~ xt ). In the 
absence of dissipation the average value is constant and the variance grows linearly 
in time. However for finite dissipation the average decays exponentially in time 
and the variance approaches a constant value given by the ratio of the diffusion 
coefficient and the dissipation rate. If Z is the velocity of a Brownian particle the 
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variance would be proportional to the temperature of the surrounding fluid resulting 
in the Einstein relation D /A = AbT, see, for example, Flirth . 61 

Multiplicative fluctuations can be addressed by examining the FPE associated 
with a linear stochastic dissipation parameter . 118,193 Consider the nonlinear rate 
equation with multiplicative fluctuations 

dx 

— = -AXln X + Z{t)X (48) 

for which the logarithmic transformation Z = InX yields the rate equation with 
additive fluctuations £(£) given by Eq. (42). The FPE for the transformed equation 
is Eq. (46) with the solution Eq. (47). In terms of the original variable the pdf is a 
log-normal distribution, for more details see Goel et al. 76 


5.1.2. Stochastic ontogenetic growth model 

A more interesting rate equation was first considered by von Bertalanffy 219 who 
postulated a simple nonlinear rate equation to describe the growth of TBM m of 
the form 

^ = A^rrf 1 - Cpm 13 (49) 

at time t where g and (3 are unspecified exponents, and A^ and Cp are positives 
constants. The solution to this equation was studied by von Bertalanffy 219 with 
r) = 2/3 and (3 = 1. More recently West et al 250 constructed Eq. (49), called an 
ontogenetic growth model (OGM), from a conservation of energy argument and 
obtained a universal curve from the solution and fit it to data by using g = 3/4 
and f3 = 1. In the OGM the parameters are given by Cp = with B m the 

metabolic rate required to maintain an existing unit of biomass; the metabolic 
energy required to create a unit biomass and A v = a/E m . Banavar et al. 10 obtained 
an equivalent universal curve and fit the same data for both r] = 2/3, 3/4 and (3 = 1. 
The OGM equation has the scaling form: 

dm 

— = mV(m/M 0 ), (50) 

where Mq is chosen to be the maximum TBM that solves the stationary equation 
dm/dt = 0. The data from 13 organisms graphed as r = (m/Mo) 1_// versus r = 
— ln(l — r) collapse onto a single universal curve 250 that is fit equally well with 
rj = 2/3 or r/ = 3/4. 10 

West and West 246 generalized the OGM to incorporate the disordering influ¬ 
ence of entropy by including the statistical fluctuations of the measured mass. The 
dynamics of such a stochastic process eventually erases the influence of the initial 
state and this gradual loss of information is a manifestation of increasing entropy. 
They refer to this as the stochastic ontogenetic growth model (SOGM). Consider a 
phenomenological Langevin equation with multiplicative fluctuations for the TBM 
where to avoid confusion we use the dummy variable Z = m, which is actually the 
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TBM for species i and individual j and individual j given by Mij. Thus, we replace 
Eq. (49) for r] = b and /3 = 1 for the SOGM stochastic differential equation: 

d7 

— = A b Z b - C 0 Z + zat) , (51) 

at 

with £(£) given by a delta correlated random process of strength D with normal 
statistics and correlations given by Eq. (43). Rather than assuming the rate of 
creating a unit biomass is a single number C\ they assume it has a stochastic 
component: C\ Co + £(£), with average value Co- 

An equivalent description of the SOGM is given by the corresponding dynamics 
of the pdf in phase space. The FPE corresponding to Eq. (51) is 118,193 : 


dP(z,t) 

dt 


d_ 

dz 



Coz) H- Dz 


d_ 

dz 


PM. 



The general solution to Eq. (52) remains a mystery, but the steady-state solution 
(t oo) denoted by the ss subscript obtained by setting the probability flux to 
zero is given by: 



normalized on the interval ( 0 , oo) with the parameter values 7 = Ab/ftD, fi = 
1 + (Co/D) and /3 = 1 — b. Note that the steady-state solution to the SOGM FPE 
replaces the deterministic steady-state solution obtained from the OGM . 10,250 

West and West 246 fit the parameters in Eq. (53) to a data set of mammalian 
species tabulated by Heusner 91 and this fit is depicted in Fig. 11. They constructed 
a histogram by partitioning the mass axis into intervals of 20 g and counting the 
number of species within each interval. The vertical axis is the relative number of 
species as a function of TBM. The allometry exponet is fixed at b = 3/4 so that 
f3 = 1/4 and the remaining parameters are determined to have the best values of 
7 = 13.4 and p = 2.04 with a quality of fit measured by the correlation coefficient 
r 2 = 0.96. 

Figure 11 shows the fit to the low-mass species out to 500 g. The remaining fit 
out to three million grams is not shown but is in fact the more ineresting part of the 
distribution. To capture this information on a single graph we 246 construct a second 
histogram, this one for the asymptotic region from approximately one thousand to 
three million grams. Figure 12 depicts the fit to the logarithmic histogram data 
points indicated by the dots starting at a TBM of 1.1 kg. An inverse power-law 
distribution would be a straight line with a negative slope on this log-log graph. 
Inserting the parameter values p = 1.67 and 7 — 8.96 into the steady-state TBM 
pdf given by Eq. (53) yields the solid cuve in Fig. 12 which fits the data extremely 
well. The curve is quite clearly an inverse power law in the interspecies TBM. 
This coarse-grained description of the interspecies TBM statistics indicates great 
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Fig. 11. The histogram constructed from the average TBM data for 391 mammalian species 91 
is given by the dots. The mass data has been divided into intervals of 20 g each and the number 
of species within each such interval counted. The horizontal axis is the relative frequency in each 
interval and the solid line segment is the least squares fit of Eq. (53) to the data points. The 
quality of fit is r 2 = 0.96. 246 



z jubh (g) 


Fig. 12. The average TBM data for 391 mammalian species 91 are used to construct a histogram. 
The mass interval is divided into 20 equally spaced intervlas on a logarithm scale and the number 
of species within each interval counted. The least-square fit to the nine data points is then made 
using logrimically transformed distribution, see Ref. 246 for details. The quality of the fit is 
r 2 = 0.998. 

variability particularly since fi < 2 indicating that the variance of the interspecies 
TBM diverges. 


5.1.3. Interspecies AR derivation 

We 24 b243 ex pii c itly constructed a statistical theory of the fluctuations in the mea¬ 
sured species metabolic rate Bi and TBM Mi to calculate the interspecies metabolic 
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AR. The strategy was to related the average BMR to the average TBM through the 
AR. This approach can be followed here from the long-time or steady-state average 
SOGM TBM using the steady-state pdf : 




* oo 


z 1 M exp 


—pz P}dz 


and replacing z with the TBM for species i reduces the average to: 


(Mi) = p/P 






Here we interpret the ensemble average in terms of an average over an ensemble of 
individual members of species i. In this equation we have approximated the ratio 
of gamma functions by an asymptotic value and substituting the parameter values 
into the right-hand side of Eq. (54) yields: 


{Mf) « 





The equality in Eq. (55) is obtained by substituting the values of the parameters 
from OGM and using 6 = 3/4 and [3 = 1/4 to yield 

B m ~ &(AA) -1 / 4 • (56) 


Thus, the average metabolic rate necessary to maintain a unit biomass is dependent 
on the inverse quarter power of the average TBM of the species. This is the same 
expression obtained by Moses et al. 149 if we identify their adult TBM with the 
average TBM obtained from the steady-state pdf of the SOGM. 

Note that the average TBM for species i replaces Mq obtained from the 
dm/dt = 0 solution to Eq. (50). On the other hand the average BMR can be deter¬ 
mined from the 6th moment of the TBM 



Comparing this last equation with Eq. (54) allows us to write: 

(Bt) « aiMp-P = a(Mp , (58) 

which is the interspecies metabolic AR. We emphasize 246 that the interspecies AR 
is a pheomenological equation that without the SOGM does not have a formal 
theoretical basis. 
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The SOGM constitutes the first theoretical construction of an interspecies 
metabolic AR starting from a fundamental dynamic equation and relating the 
proper averages. It is true that we used a nonlinear multiplicative Langevin equa¬ 
tion, but the original dynamics stem from the conservation of energy argument 250 
and the fluctuations are a consequence of the dynamics being degraded by en¬ 
tropy. 243 This suggests a possible untraveled path for future research in this area. 


5.2. Scaling solution for AR 

A phase space equation for anomalous diffusion containing the influence of long¬ 
term memory on the dynamics of the pdf was first discussed by West and Seshadri 227 
in term of the fractional calulus. The resulting fractional diffusion equation 108,236 
(FDE) has subsequently found a variety of uses in the physical sciences. The pdf 
that solves a class of FDEs satisfies the scaling equation 


P(z,t) = 


1 







Note that this scaling of the pdf is also a realization of the RG relation for the 
random variable Z(bt) = b^Z(t). 

The function F z (-) in Eq. (59) is left unspecified but it is analytic in the similarity 
variable z/t^ . As mentioned in the introduction a standard diffusion process Z(t) 
is the displacement of the diffusing particle from its initial position at time £, and 
for vanishing small dissipation the scaling parameter is p z = 1/2 and the functional 
form of F z {f) is a normal distribution. However, for general complex phenomenon 
there is a broad class of distributions for which the functional form of F z (-) is 
not Gaussian and the scaling index p z ^ 1/2. For example, the a-stable Levy 
process 147,184,198,264 scales in this way and the Levy index is in the range 0 < a < 
2, with the equality holding for the Gauss distribution and the scaling index is 
related to the Levy index by p z = 1/cq see West, Geneston and Grigolini 239 and 
Uchaikin 216 for very different discussions of this scaling. 

The Gibbs entropy (Shannon information), 23,197 can be defined using the pdf 
for the variable Z(t) Eq. (59) to obtain: 



P(z, t) In P(z, t)dz = 5° + fi z In t , 



so the entropy deviation from the reference state S' 0 = — f F z (q) In F z (q)dq inte¬ 
grated with respect to the scaled variable q = z/t^ x , increases logarithmically in 
the independent variable t. 

Huxley 98 assumed the differential growth equations in an organism to have 
the same form with proportional growth rates. West and West 241,243 adapted this 
assumption and presumed the two parts of an organism share the same class of 
probability densities that describe their interacting observables. Here t is clock time, 
the independent variable, and the network measures are the dependent variables, 
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the average of a generic scaling observable using the pdf given by Eq. (59) yields: 


(m) 


zP(z, t)dz = F z 


qF z (q)dq = F z Z . 



Note that Z = JqF z (q)dq is a finite, time-independent constant. Comparing 
Eqs. (61) and (60) enables us to write for the change in entropy relative to a 
reference state: 


AS* = In [(Z(t))/Z\ (62) 

so that the change in the average value of an observable is determined by the change 
in entropy 

(Z(t)) = Ze ASAt) . (63) 

Consequently Z may be identified with either of the network variables to obtain the 
relation between entropies from Eq. (60) 

(i X XSy / fdy 

and using Eq. (63) we can write: 

(Yi) = ?e AS « = YeT% AS * = a{Xi) b (64) 

with the allometry coefficient given by a = Y/X b and the allometry exponent by 
b = jiyjj± x . We have again introduced the index for the single species i to emphsize 
that this is the interspecies AR. This derivation of the empirical interspecies AR is 
solely a consequence of the scaling properties of the pdf 1 s. 

The rate of entropy generation is determined by the time derivative of Eq. (63) 
for X and Y and substituting from the entropy balance equation yields: 

1 d{)Q 1 d(Yj) 

Vx(Xi) dt fi y (Y) dt ' 1 j 

Equation (65) shows the correspondence of the probabilistic approach to AR with 
that of Huxley, with the dynamic variables of individual species members replaced 
by averages over an ensemble of individuals. In the present case the allometry ex¬ 
ponent is now the ratio of the power-law indices in the pdf 1 s, which from Eq. (65) 
can also be interpreted as the ratio of growth rates. Moreover, the allometry coeffi¬ 
cient is determined by the scaling functions in the pdf 1 s. It is the average response 
or adaptation of the subnetwork to the average behavior of the host network that 
is captured by the interspecies AR through the balance of the entropy generated. 
Moreover it is the scaling in the pdf and not necessarily geometric scaling of fractal 
networks that is the origin of the physiologic AR. Underlying this derivation is the 
assumption made earlier that this is an information-dominated network and is con¬ 
sequently driven by information (entropy) gradients and not the energy gradient 
necessary in the fractal geometry derivation of nutrient transport. 
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6. Conclusion 

The statistical analysis of the metabolic data presented herein show that empir¬ 
ical ARs exist across multiple species and consequently the form of ARs are not 
solely dependent on species specific mechanisms. Moreover we have shown there is 
no universal value for the allometry exponent for three reasons. First there is the 
variability in the values of the allometry exponent obtained from data analysis in 
Sec. 3. We could rehash the arguments here, but an objective observer would have 
to conclude that although there are indications that a particular value might be 
strongly indicated in a specific context, the evidence for universality is not com¬ 
pelling. Even in the case of metabolic ARs one could reasonably make a case of 
b = 2/3 for small animals, b = 3/4 for large animal and 6=1 for plants, but no 
one value of b that spans the total range of animal and plant sizes. Second, the 
only theories that predict a universal value of the allometry exponent do so to ex¬ 
plain the intraspecies AR and not the interspecies AR. There is no first principles 
theory that derives the empirical AR between the averaged observables. The phe¬ 
nomenological theory presented in Section 4 successfully derives the interspecies 
AR and treats the allometry coefficient and allometry exponent as empirical pa¬ 
rameters determined by the parameters in the pd/’s. Third and last the allometry 
exponent and coefficient are determined to co-vary using theory 243 and statistical 
data analysis. 72 

However let us not be too critical of phenomenological theories. Recall that 
thermodynamics is a phenomenological theory that has enjoyed remarkable suc¬ 
cess in explaining a plethora of complex physical phenomena. Even so the physics 
community has not yet been able to derive thermodynamics from the more reduc¬ 
tionist (fundamental) statistical mechanics, so that a mechanistic understanding 
of thermodynamics remains controversial. By the same token the phenomenolog¬ 
ical theory of AR presented herein takes the discussion of AR out of the domain 
of the reductionist approaches previously used to derive the intraspecies ARs and 
refocuses it on the statistical properties of the empirical interspecies ARs. These 
analyses indicate a new avenue for the study of physiologic ARs, one that system¬ 
atically includes both deterministic and statistical aspects using the probability 
calculus. This new perspective indicates that the origins of physiologic AR reside 
in the scaling properties of the pdf 1 s for nested complex networks. 

In simple physical phenomena the dynamics of a network coupled to the en¬ 
vironment is described by a Langevin equation, which is a stochastic differential 
equation. This mechanical description of the dynamics is equivalent to the phase 
space dynamics of the pdf such as given by the FPE. As the network becomes more 
complex its past history becomes more and more influential until finally the FPE 
must be replaced with a FDE and the dynamics are the domain of the fractional 
calculus. As we saw in Sec. 5 the general solution to a class of FDE’s is a scaling pdf 
implying that ARs can result from scaling in the statistical dynamics of the phe¬ 
nomenon. The influence of the history of the growth of an organism on the growth 
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of an organ leads to intraspecies AR; the history of the size of an organism on the 
metabolic rate of the organism across species leads to interspecies AR. 

The arguments in Sec. 5 imply that empirical ARs are a consequence of infor¬ 
mation transfer between complex information-dominated networks. Of course we 
have not rigorously proven that the entropy decrease (information increase) in the 
host network and the increase in entropy (information decrease) in the subnetwork 
have the relations assumed, this remains to be done. However the empirical rela¬ 
tions are consistent with the Principle of Complexity Management in which the 
transfer of information between two complex information-dominated networks has 
been shown 7,239 to proceed from the network with the greater information content 
to the network with the lesser. 

The phenomenological theory of empirical AR presented herein incorporates 
both stochastic and reductionistic mechanisms. From this approach we conclude 
that the empirical AR for the BMR, the life time of an organism and the myriad 
of other complex phenomena are not completely explained by reductionistic mech¬ 
anisms. In particular the allometry parameters are determined not to be universal. 
This was supported by the demonstration that the allometry exponent and coeffi¬ 
cient co-vary. On the other hand, the laws from the theory of probability can have 
universal forms without having universal parameters, for example the Law of Fre¬ 
quency of Errors and the Law of Large Numbers. The empirical AR is in part a 
consequence of the generic statistical behavior of complex networks that depend on 
the infinitely divisible nature of the pdf. 74,75 The phenomenological theory of AR 
presented herein is the application of the probability calculus to the understanding 
of the origins of the empirical relations between averages of different parts of a com¬ 
plex network. Consequently empirical AR is a manifestation of a law whose origin 
can be traced back to the probability calculus and the balancing of deterministic 
and stochastic mechanisms. 
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Abstract —The problem of distributed learning and channel ac¬ 
cess is considered in a cognitive network with multiple secondary 
users. The availability statistics of the channels are initially 
unknown to the secondary users and are estimated using sensing 
decisions. There is no explicit information exchange or prior 
agreement among the secondary users and sensing and access 
decisions are undertaken by them in a completely distributed 
manner. We propose policies for distributed learning and ac¬ 
cess which achieve order-optimal cognitive system throughput 
(number of successful secondary transmissions) under self play, 
i.e., when implemented at all the secondary users. Equivalently, 
our policies minimize the sum regret in distributed learning and 
access, which is the loss in secondary throughput due to learning 
and distributed access. For the scenario when the number of 
secondary users is known to the policy, we prove that the total 
regret is logarithmic in the number of transmission slots. This 
policy achieves order-optimal regret based on a logarithmic lower 
bound for regret under any uniformly-good learning and access 
policy. We then consider the case when the number of secondary 
users is fixed but unknown, and is estimated at each user through 
feedback. We propose a policy whose sum regret grows only 
slightly faster than logarithmic in the number of transmission 
slots. 

Index Terms—Cognitive medium access control, multi-armed 
bandits, distributed algorithms, logarithmic regret. 


I. Introduction 

T HERE has been extensive research on cognitive radio 
network in the past decade to resolve many challenges not 
encountered previously in traditional communication networks 
(see e.g., [2]). One of the main challenges is to achieve 
coexistence of heterogeneous users accessing the same part of 
the spectrum. In a hierarchical cognitive network, there are two 
classes of transmitting users, viz., the primary users who have 
priority in accessing the spectrum and the secondary users who 
opportunistically transmit when the primary user is idle. The 
secondary users are cognitive and can sense the spectrum to 
detect the presence of a primary transmission. However, due 
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to resource and hardware constraints, they can sense only a 
part of the spectrum at any given time. 

We consider a slotted cognitive system where each sec¬ 
ondary user can sense and access only one orthogonal channel 
in each transmission slot (see Fig. 1). Under sensing con¬ 
straints, it is thus beneficial for the secondary users to select 
channels with higher mean availability, i.e., channels which 
are less likely to be occupied by the primary users. However, 
in practice, the channel availability statistics are unknown to 
the secondary users at the start of the transmissions. 

Since the secondary users are required to sense the medium 
before transmission, can these sensing decisions be used to 
learn the channel availability statistics? If so, using these 
estimated channel availabilities, can we design channel access 
rules which maximize the transmission throughput? Designing 
provably efficient algorithms to accomplish the above goals 
forms the focus of our paper. Such algorithms need to be 
efficient, both in terms of learning and channel access. 

For any learning algorithm, there are two important perfor¬ 
mance criteria: consistency and regret bounds [3]. A learning 
algorithm is said to be consistent if the learnt estimates 
converge to the true values as the number of samples goes 
to infinity. The regret 1 of a learning algorithm is a measure 
of the speed of convergence, and is thus meaningful only for 
consistent algorithms. In our context, the users need to learn 
the channel availability statistics consistently in a distributed 
manner. The regret in this case is defined as the loss in 
secondary throughput due to learning when compared to the 
ideal scenario where the channel statistics are known perfectly 
(see (2)). It is thus desirable to design distributed learning 
algorithms with small regret. 

Additionally, we consider a distributed framework where 
there is no information exchange or prior agreement among 
the secondary users. This introduces additional challenges: 
it results in loss of throughput due to collisions among the 
secondary users, and there is now competition among the 
secondary users since they all tend to access channels with 
higher availabilities. It is imperative that the channel access 
policies overcome the above challenges. Hence, a distributed 
learning and access policy experiences regret both due to 
learning the unknown channel availabilities as well as due 
to collisions under distributed access. 

1 Note that the sum regret is a finer measure of performance of a 
distributed algorithm than the time-averaged total throughput of the users 
since any sub-linear regret (with respect to time) implies optimal average 
throughput. On the other hand, regret is equivalent to total throughput and 
hence, optimal regret is equivalent to achieving optimal total throughput. 
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Fig. 1. Cognitive radio network with U = 4 secondary users and C = 5 
channels. A secondary user is not allowed to transmit if the accessed channel 
is occupied by a primary user. If more than one secondary user transmits in 
the same free channel, then all the transmissions are unsuccessful. 


A. Our Contributions 

The main contributions of this paper are two fold. First, 
we propose two distributed learning and access policies under 
multiple secondary users in a cognitive network. Second, we 
provide performance guarantees for these policies in terms of 
regret. Overall, we prove that one of our proposed algorithms 
achieves order-optimal regret and the other achieves nearly 
order-optimal regret, where the order is with respect to the 
number of transmission slots. 

The first policy we propose assumes that the total number 
of secondary users in the system is known while our second 
policy removes this requirement. We provide bounds on the 
total regret experienced by the secondary users under self 
play, i.e., when implemented at all the secondary users. For 
the first policy, we prove that the regret is logarithmic, i.e., 
0(log n) where n in the number of transmission slots. For the 
second policy, the regret grows slightly faster than logarithmic, 
i.e., 0(f(n) logn), where we can choose any function f(n) 
satisfying f(n) —» oo, as n —» oo. Hence, we provide 
performance guarantees for the proposed distributed learning 
and access policies. 

A lower bound on regret under any uniformly-good dis¬ 
tributed learning policy has been derived in [4], which is also 
logarithmic in the number of transmission slots (See (6) for 
definition of uniformly-good policies). Thus, our first policy 
(which requires knowledge of the number of secondary users) 
achieves order-optimal regret. The effects of the number of 
secondary users and the number of channels on regret are 
also explicitly characterized and verified via simulations. 

The exploration-exploitation tradeoff for learning, com¬ 
bined with the cooperation-competition tradeoffs among mul¬ 
tiple users for distributed medium access have not been 
sufficiently examined in the literature (see Section I-B for a 
discussion). Our analysis in this paper provides important en¬ 
gineering insights towards dealing with learning, competition, 
and cooperation in practical cognitive systems. 


Remark on Model Assumptions: We note some of the 
limitations of our approach. We prove order-optimal regret for 
our proposed algorithms under an i.i.d. primary transmission 
model 2 , where the probability of a channel being occupied 
by any primary user in any slot is assumed to be i.i.d. 
This is indeed is idealistic and in practice, a Markovian 
model may be more appropriate [5], [6]. The i.i.d. model 
leads to simple learning schemes for channel availabilities 
but is not crucial towards deriving regret bounds for our 
proposed schemes. Extensions of the classical multi-armed 
bandit problem to a Markovian model are considered in [7]. 
In principle, our results on achieving order-optimal regret for 
distributed learning and access can be similarly extended to 
a Markovian channel model but this entails more complex 
estimators and rules for evaluating the exploration-exploitation 
tradeoffs of different channels and is a topic of interest for 
future investigation. 


B. Related Work 

Several results on the multi-armed bandit problem will be 
used and generalized to study our problem. Detailed discussion 
on multi-armed bandits can be found in [8]-[ll]. Cognitive 
medium access is a topic of extensive research; see [12] 
for an overview. The connection between cognitive medium 
access and the multi-armed bandit problem is explored in 
[13], where a restless bandit formulation is employed. Under 
this formulation, indexability is established, Whittle’s index 
for channel selection is obtained in closed-form, and the 
equivalence between the myopic policy and the Whittle’s 
index is established. However, this work assumes known 
channel availability statistics and does not consider competing 
secondary users. The work in [14] considers allocation of two 
users to two channels under a Markovian channel model using 
a partially observable Markov decision process (POMDP) 
framework. The use of collision feedback information for 
learning, and spatial heterogeneity in spectrum opportunities 
were investigated. However, the difference from our work 
is that [14] assumes that the availability statistics (transition 
probabilities) of the channels are known to the secondary users 
while we consider learning of unknown channel statistics. The 
works in [15], [16] consider centralized access schemes in 
contrast to distributed access here, the work in [17] considers 
access through information exchange and studies the optimal 
choice of the amount of information to be exchanged given 
the cost of negotiation. The work in [18] considers access 
under Q -learning for two users and two channels where users 
can sense both the channels simultaneously. The work in 
[19] discusses a game-theoretic approach to cognitive medium 
access. In [20], learning in congestion games through multi¬ 
plicative updates is considered and convergence to weakly - 
stable equilibria (which reduces to the pure Nash equilibrium 
for almost all games) is proven. However, the work assumes 
fixed costs (or equivalently rewards) in contrast to random 
rewards here, and that the players can fully observe the actions 
of other players. 

2 By i.i.d. primary transmission model, we do not mean the presence of 
a single primary user, but rather, this model is used to capture the overall 
statistical behavior of all the primary users in the system. 
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Recently, the work in [21] considers combinatorial bandits, 
where a more general model of different (unknown) channel 
availabilities is assumed for different secondary users, and a 
matching algorithm is proposed for jointly allocating users 
to channels. The algorithm is guaranteed to have logarithmic 
regret with respect to the number of transmission slots and 
polynomial storage requirements. A decentralized implemen¬ 
tation of the proposed algorithm is proposed but it still 
requires information exchange and coordination among the 
users. In contrast, we propose algorithms which removes this 
requirement albeit in a more restrictive setting of identical 
channel availabilities for all users. 

Recently, Liu and Zhao [4] proposed a family of distributed 
learning and access policies known as time-division fair share 
(TDFS), and proved logarithmic regret for these policies. They 
established a lower bound on the growth rate of system regret 
for a general class of uniformly-good decentralized polices. 
The TDFS policies in [4] can incorporate any order-optimal 
single-player policy while our work here is based on the 
single-user policy proposed in [11]. Another difference is 
that in [4], the users orthogonalize via settling at different 
offsets in their time-sharing schedule, while in our work here, 
users orthogonalize into different channels. Moreover, the 
TDFS policies ensure that each player achieves the same time- 
average reward while our policies here achieve probabilistic 
fairness, in the sense that the policies do not discriminate be¬ 
tween different users. In [22], the TDFS policies are extended 
to incorporate imperfect sensing. 

In our prior work [1], we first formulated the problem 
of decentralized learning and access for multiple secondary 
users. We considered two scenarios: one where there is initial 
common information among the secondary users in the form of 
pre-allocated ranks, and the other where no such information 
is available. In this paper, we analyze the distributed policy in 
detail and prove that it has logarithmic regret. In addition, we 
also consider the case when the number of secondary users is 
unknown, and provide bounds on regret in this scenario. 

Organization: Section II deals with the system model, 
Section III deals with the special case of single secondary 
user and of multiple users with centralized access which 
can be directly solved using the classical results on multi¬ 
armed bandits. In Section IV, we propose distributed learning 
and access policy with provably logarithmic regret when the 
number of secondary users is known. Section V considers the 
scenario when the number of secondary users is unknown. 
Section VI provides a lower bound for distributed learning. 
Section VII includes some simulation results for the proposed 
schemes and Section VIII concludes the paper. Most of the 
proofs are found in the Appendix. 

Since Section III mostly deals with a recap of the clas¬ 
sical results on multi-armed bandits, those familiar with the 
literature (e.g., [8]—[ 11]) may skip this section without loss of 
continuity. 


II. System Model & Formulation 

Notation: For any two functions /(n),g(n), f{n) = 
0(g(n)) if there exists a constant c such that f{n) < cg(n ) 
for all n > no for a fixed no G N. Similarly, /(n) = Q(g(n)) 
if there exists a constant c! such that /(n) > dg[n ) for all 


n > no for a fixed no G N, and /(n) = 0(g(n)) if /(n) = 
f 2(g(n)) and f(n) = 0(g(n)). Also, /(n) = o(g(n)) when 
f(n)/g(n) —» 0 and /(n) = uj(g(n)) when f(n)/g(n) —» oo 


as n —» oo. 

For a vector /x, let \/x\ denote its cardinality, and let 

IT 


/a = [/ii, fi 2 5 , P\/x\} 1 • We refer to the U highest entries 

in a vector jx as the U -best channels and the rest as the U- 
worst channels, where 1 < U < \/x\. Let cr(a; /x) denote 
the index of the a th highest entry in /x . Alternatively, we 
abbreviate a*:=cr(a; /x ) for ease of notation. Let B(/x) denote 
the Bernoulli distribution with mean /x. With abuse of notation, 
let D(/xi,/X 2 ):=D(B(/ii); B(fi 2 )) be the Kullback-Leibler 
distance between the Bernoulli distributions B(/x i) and B(/x 2 ) 
[23] and let A(l, 2):=/xi — /x 2 . 


A. Sensing & Channel Models 

Let U > 1 be the number of secondary users 3 and C > U 
be the number 4 of orthogonal channels available for slotted 
transmissions with a fixed slot width. In each channel i and 
slot k, the primary user transmits i.i.d. with probability 1 — 
fii > 0. In other words, let Wi(k) denote the indicator variable 
if the channel is free 


Wi(k) 


0, channel i occupied in slot k 

1, o.w, 


2 ^ ^ 

and we assume that Wi(k ) " B(k). 

The mean availability vector /x consists of mean availabil¬ 
ities fii of all channels, i.e., is /x:=[/xi ,..., /xc), where all 
\X{ G (0,1) and are distinct, /x is initially unknown to all the 
secondary users and is learnt independently over time using 
the past sensing decisions without any information exchange 
among the users. We assume that channel sensing is perfect 
at all the users. 

Let Tij(k) denote the number of slots where channel i is 
sensed in k slots by user j (not necessarily being the sole 
user to sense that channel). Note that Yli=i Tij(k) = k for 
all users j , since each user senses exactly one channel in every 
time slot. The sensing variables are obtained as follows: at the 
beginning of the k th slot, each secondary user j G U selects 
exactly one channel i G C for sensing, and hence, obtains the 
value of Wi(k ), indicating if the channel is free. User j then 
records all the sensing decisions of each channel i in a vector 
XU=A;,i(l ),...,X ZJ (T i , j (k))] T . Hence, uf =1 X^. is the 
collection of sensed decisions for user j in k slots for all the 
C channels. 

We assume the collision model under which if two or 
more users transmit in the same channel then none of the 
transmissions go through. At the end of each slot k, each 
user j receives acknowledgement Zj(k ) on whether its trans¬ 
mission in the k th slot was received. Hence, in general, any 
policy employed by user j in the (k + l) th slot, denoted by 
p( U^XjLjZ^), is based on all the previous sensing and 
feedback results. 


3 A user refers to a secondary user unless otherwise mentioned. 

4 When U > C, learning availability statistics is less crucial, since all 
channels need to be accessed to avoid collisions. In this case, design of 
medium access is more crucial. 
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Algorithm 1 Single User Policy p 1 ( g(n)) in [10]. 

Input: : Sample-mean availabilities after n 

rounds, g(i\n ): statistic based on Xij(n) 9 
a (a; g (n)): index of a th highest entry in g (n). 

Init: Sense in each channel once, n <— C, Curr_Sel <— C. 
Loop: n n + 1 

Curr_Sel channel corresponding to highest entry in g (n) 
for sensing. If free, transmit. 


B. Regret of a Policy 

Under the above model, we are interested in designing 
policies p which maximize the expected number of successful 
transmissions of the secondary users subject to the non¬ 
interference constraint for the primary users. Let S(n; pi, U, p) 
be the expected total number of successful transmissions after 
n slots under U number of secondary users and policy p. 

In the ideal scenario where the availability statistics pi are 
known a priori and a central agent orthogonally allocates the 
secondary users to the U -best channels, the expected number 
of successful transmissions after n slots is given by 

u 

S*(n;n, U):=n'y2v(j*); (!) 

3 = 1 3 * 

where j* is the j th -highest entry in pi. 

It is clear that S*(n; H,U) > S(n ; pi , U, p) for any policy 
p and any finite n £ N. We are interested in minimizing the 
regret in learning and access, given by 

R{n ; pi, U, p):=S*(n ; pi, U) — S(n ; pi, U, p) > 0. (2) 


We are interested in minimizing regret under any given pi £ 
(0, l) c with distinct elements. 

By incorporating the collision channel model assumption 
with no avoidance mechanisms 5 , the expected throughput 
under the policy p is given by 

c u 

S(n;n,U,p) = 

i =1 3 = 1 


where Vij(n) is the number of times in n slots where user 
j is the sole user to sense channel i. Hence, the regret in (2) 
simplifies as 


u c u 

R(n; p) = y]n/x(F) - E[Vij(n) 

k —1 i —1 j —1 



multi-armed bandit process. Lai and Robbins [8] first analyzed 
schemes for multi-armed bandits with asymptotic logarithmic 
regret based on the upper confidence bounds on the unknown 
channel availabilities. Since then, simpler schemes have been 
proposed in [10], [11] which compute a statistic or an index for 
each arm (channel), henceforth referred to as the g-statistic, 
based only on its sample mean and the number of slots where 
the particular arm is sensed. The arm with the highest index is 
selected in each slot in these works. We summarize the policy 
in Algorithm 1 and denote it p 1 (g(n)), where g(n) is the 
vector of scores assigned to the channels after n transmission 
slots. 

The sample-mean based policy in [11, Thm. 1] proposes an 
index for each channel i and user j at time n is given by 


MEAN 

yj 






where T^j(n) is the number of slots where user j selects 
channel i for sensing, 



T, j ('ft-) 

(T id (n)):= Y, 

k =1 


XiAk) 


is the sample-mean availability of channel i, as sensed by user 

j- 

The statistic in (4) captures the exploration-exploitation 
tradeoff between sensing the channel with the best predicted 
availability to maximize immediate throughput and sensing 
different channels to obtain improved estimates of their avail¬ 
abilities. The sample-mean term in (4) corresponds to exploita¬ 
tion while the other term involving Tij(n) corresponds to 
exploration since it penalizes channels which are not sensed 
often. The normalization of the exploration term with logn 
in (4) implies that the term is significant when Tij(n) is 
much smaller than log n. On the other hand, if all the channels 
are sensed ©(logn) number of times, the exploration terms 
become unimportant in the ^-statistics of the channels and the 
exploitation term dominates, thereby, favoring sensing of the 
channel with the highest sample mean. 

The regret based on the above statistic in (4) is logarithmic 
for any finite number of slots n but does not have the optimal 
scaling constant. The sample-mean based statistic in [10, 
Example 5.7] leads to the optimal scaling constant for regret 
and is given by 


III. Special Cases From Known Results 

We recap the bounds for the regret under the special cases 
of a single secondary user (U = 1) and multiple users with 
centralized learning and access by appealing to the classical 
results on the multi-armed bandit process [8]-[10]. 

A. Single Secondary User (U = 1) 

When there is only one secondary user, the problem of 
finding policies with minimum regret reduces to that of a 

5 If the users employ CSMA-CA to avoid collisions then the regret is 
reduced. The bounds derived in this paper are applicable for this case as 
well. 



= x i,j( T ij(. n )) + min 



logn 
2 Tij(n) 


1 



In this paper, we design policies based on the g MEAN statistic 
since it is simpler to analyze than the g OPT statistic. This is 
because g MEAN is a continuous function in Tij(n) while g OPT 
has a threshold on T^y(n). 

We now recap the results which show logarithmic regret in 
learning the best channel. In this context, we define uniformly 
good policies p [8] as those with regret 

i^(n;/x, U, p) = o(n a ), \/a > 0, pi £ (0, l) c . (6) 
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Theorem 1 (Logarithmic Regret for U = 1 [10], [11]): 

For any uniformly good policy p satisfying (6), the expected 
time spent in any suboptimal channel i ^ 1* satisfies 


lim P 

n—> oo 


Ti,i(n) > 


(1 — e) logn 

i*) 




where 1* is the channel with the best availability. Hence, the 
regret satisfies 


ItoinfS^iLM > V A(1 ’’ i) , . 

oo logn D(LLi,LL 1 * ) 

iG 1-worst ’ 


( 8 ) 


The regret under the g OPT statistic in (5) achieves the above 
bound. 


lull - 


n—> oo 


logn 


E 


iG 1-worst 

The regret under g MEAN statistic in (34) satisfies 

8 logn 


A(iy) 


(9) 






+ 1 + 


7T 


Algorithm 2 Centralized Learning Policy p CENT in [9]. 

Input: X n := U7 =1 : Channel availability after n 

slots, g(n): statistic based on A n , 

cr(a; g(n)): index of a th highest entry in g(n). 

Init: Sense in each channel once, n <— C 
Loop: n <— n + 1 

Curr_Sel channels with /7-best entries in g(n). If free, 
transmit. 


Notice that for both the single user case U = 1 (Theorem 1) 
and the centralized multi-user case (Theorem 2), the number 
of time slots spent in the U-worst channels is ©(logn) and 
hence, the regret is also ©(logn). 


IV. Main Results 

Armed with the classical results on multi-armed bandits, we 
now design distributed learning and allocation policies. 


B. Centralized Learning & Access for Multiple Users 

We now consider multiple secondary users under centralized 
access policies where there is joint learning and access by a 
central agent on behalf of all the U users. Here, to minimize 
the sum regret, the centralized policy allocates the U users to 
orthogonal channels to avoid collisions. Let p CEm (X k ), with 
X k := Uf =1 XE, denote a centralized policy based 
on the sensing variables of all the users. The policy under 
centralized learning is a simple generalization of the single- 
user policy and is given in Algorithm 2. We now recap the 
results of [9]. 

Theorem 2 (Regret Under Centralized Policy p CENT [9]): 

For any uniformly good centralized policy p CEm satisfying 
(6), the expected times spent in a U -worst channel i satisfies 


lim P 

n—^ oo 


U 


(1 — e) logn 


E rr ( X ^ v- 1 - - ^ iU 6 n 
j=1 L)\pi, pjj* ) 



( 10 ) 


where U* is the channel with the /7 th best availability. Hence, 
the regret satisfies 


lim inf 

n—> oo 


R(n ; /x, 1, p CENT ) 


logn 


> 


E 

iG U-worst 


A (U*,i) 

D (pi i pu* ) 


(ID 


The scheme in Algorithm 2 based on g OPT achieves the above 
bound. 


lim 

n—^ oo 


it^n; /x, 1, p CENT (g 0PT ) 

logn 


E 


iG U-worst 


A(£/V) 
D(pi, pu*) 


( 12 ) 


The scheme in Algorithm 2 based on the g MEAN satisfies for 
any n > 0, 


R(n; p,U, p CENT ( g MEAN )) 

u u A / * -\ 

< S S ^ ^ ^ 

rri =1 2 G U-worst k =1 ^ 


8 log n 7 r 2 

---h 1 H- 

A (m*,z) 2 3 

(13) 


Proof: See Appendix A. 


□ 


A. Preliminaries: Bounds on Regret 

We first provide simple bounds on the regret in (3) for any 
distributed learning and access policy p. 

Proposition 1 (Lower and Upper Bounds on Regret): The 
regret under any distributed policy p satisfies 


u 

R(n; p) > E E A(?7*,i)E[7U(n); 

j —1 iG U-worst 
U 

]T E E PU( n )] +E[M(n)] 

j=1 iG U-worst 




(14) 

(15) 


where T^j(n) is the number of slots where user j selects 
channel i for sensing, M (n) is the number of collisions faced 
by the users in the /7-best channels in n slots, A (i,j) = 
p(i) — p(j) and p{ 1*) is the highest mean availability. 

Proof: See Appendix B. □ 

In the subsequent sections, we propose distributed learning 
and access policies and provide regret guarantees for the 
policies using the upper bound in (15). The lower bound in 
(14) can be used to derive lower bound on regret for any 
uniformly-good policy. 

The first term in (15) represents the lost transmission 
opportunities due to selection of U -worst channels (with lower 
mean availabilities), while the second term represents the 
performance loss due to collisions among the users in the 
/7-best channels. The first term in (15) decouples among the 
different users and can be analyzed solely through the marginal 
distributions of the ^-statistics at the users. This in turn, can 
be analyzed by manipulating the classical results on multi¬ 
armed bandits [10], [11]. On the other hand, the second term 
in (15), involving collisions in the /7-best channels, requires 
the joint distribution of the ^-statistics at different users which 
are correlated variables. This is intractable to analyze directly 
and we develop techniques to bound this term. 
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Algorithm 3 Policy p RAND ([/, C, gj(n)) for each user j under 
U users, C channels and statistic g j(n). 

Input: • Sample-mean availabilities at 

user j after n rounds, gj(i\n ): statistic based on Xij(n), 
cr(a;gj(n)): index of a th highest entry in g j{n). 

( j(i ; n): indicator of collision at n th slot at channel i 
Init: Sense in each channel once, n C, Curr_Rank 1, 

Loop: n n + 1 

if (j(Curr_Sek 1 n — 1) = 1 then 

Draw a new Curr_Rank ~ Unif(/7) 

end if 

Select channel for sensing. If free, transmit. 

Curr_Sel <— cr(Curr_Rank ; g j (n)). 

If collision ( j(Curr Sel;m ) <— 1, Else 0. 


5. p RAND : Distributed Learning and Access 

We present the p RAND policy in Algorithm 3. Before de¬ 
scribing this policy, we make some simple observations. If 
each user implemented the single-user policy in Algorithm 1, 
then it would result in collisions, since all the users target 
the best channel. When there are multiple users and there 
is no direct communication among them, the users need to 
randomize channel access in order to avoid collisions. At 
the same time, accessing the /7-worst channels needs to be 
avoided since they contribute to regret. Hence, users can avoid 
collisions by randomizing access over the /7-best channels, 
based on their estimates of the channel ranks. However, if the 
users randomize in every slot, there is a finite probability of 
collisions in every slot and this results in a linear growth of 
regret with the number of time slots. Hence, the users need 
to converge to a collision-free configuration to ensure that the 
regret is logarithmic. 

In Algorithm 3, there is adaptive randomization based 
on feedback regarding the previous transmission. Each user 
randomizes only if there is a collision in the previous slot; 
otherwise, the previously generated random rank for the user 
is retained. The estimation for the channel ranks is through 
the ^-statistic, along the lines of the single-user case. 


C. Regret Bounds under p RAND 


j = 1,..., U, in any i <E U -worst channel is given by 


u 


8 logn 


E[T*»] <E[ AM * )2 +1+ 3 

k —1 


(16) 


Proof: The proof is similar along the lines of Theorem 2, 

given in Appendix A. The only difference is that here, we 
need to consider the probability of transmission of each user 
in each U -worst channel while in Appendix A, the probability 
of transmission by all users is considered. □ 

We now focus on analyzing the number of collisions M(n ) 
in the U -best channels. We first give a result on the expected 
number of collisions in the ideal scenario where each user has 
perfect knowledge of the channel availability statistics p. In 
this case, the users attempt to reach an orthogonal (collision- 
free) configuration by uniformly randomizing over the U -best 
channels. 

The stochastic process in this case is a finite-state Markov 
chain. A state in this Markov chain corresponds to a con¬ 
figuration of U number of (identical) users in U number of 
channels. The number of states in the Markov chain is the 
number of compositions of Z7, given by ( 2U I f 1 ) [24, Thm. 5.1]. 
The orthogonal configuration corresponds to the absorbing 
state. There are two other classes of states in the Markov 
chain. One class consists of states in which each channel either 
has more than one user or no user at all. The second class 
consists of the remaining states where some channels may 
have just one user. For the first class, the transition probability 
to any state of the Markov chain (including self transition and 
absorption probabilities) is uniform. For a state in the second 
class, where certain channels have exactly one user, there are 
only transitions to states which have identical configuration of 
the single users and the transition probabilities are uniform. 
Fet Y(Z7, Z7) denote the maximum time to absorption in the 
above Markov chain starting from any initial distribution. We 
have the following result 

Lemma 2 (No. of Collisions Under Perfect Knowledge): 
The expected number of collisions under p RAND scheme in 
Algorithm 3, assuming that each user has perfect knowledge 
of the mean channel availabilities /x, is given by 


E[M(n); p RAND (Z7, (7, pi)] < Z7E[T(Z7, Z7)] 

2/7 - 1' 


< U 


u 


-1 


(17) 


It is easy to see that the p RAND policy ensures that the 
users are allocated orthogonally to the /7-best channels as 
the number of transmission slots goes to infinity. The regret 
bounds on p RAND are however not immediately clear and we 
provide guarantees below. 

We first provide a logarithmic upper bound 6 on the number 
of slots spent by each user in any Z7-worst channel. Hence, the 
first term in the bound on regret in (15) is also logarithmic. 

Lemma 1 (Time Spent in U-worst Channels): Under the 
p RAND scheme in Algorithm 3, the total time spent by any user 

6 Note that the bound on E [Tij(n)\ in (16) holds for user j even if the 
other users are using a policy other than p RAND . But on the other hand, to 
analyze the number of collisions E [M(nj] in (19), we need every user to 
implement p RAND . 


Proof: See Appendix C. □ 

The above result states that there is at most a finite number 
of expected collisions, bounded by Z7E[Y(Z7, U)\ under perfect 
knowledge of fi. In contrast, recall from the previous section, 
that there are no collisions under perfect knowledge of p 
in the presence of pre-allocated ranks. Hence, Z7E[Y(Z7, Z7)] 
represents a bound on the additional regret due to the lack 
of direct communication among the users to negotiate their 
ranks. 

We use the result of Femma 2 for analyzing the num¬ 
ber of collisions under distributed learning of the unknown 
availabilities fi as follows: if we show that the users are 
able to learn the correct order of the different channels with 
only logarithmic regret then only an additional finite expected 
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number of collisions occur before reaching an orthogonal 
configuration. 

Define T'(n ; p RAND ) as the number of slots where any one 
of the top U -estimated ranks of the channels at some user is 
wrong under the p RAND policy. Below we prove that its expected 
value is logarithmic in the number of transmissions. 

Lemma 3 (Wrong Order of g-statistics): Under the p RAND 
scheme in Algorithm 3, 


E [T\n; p 


RAND' 


u c 

)]^E E 

CL —1 b — CL ~|-1 


8 logn 
A(a*, 6*) 2 


+ 1 + 


7T 


(18) 


Proof: See Appendix D. □ 

We now provide an upper bound on the number of collisions 
M(n) in the U -best channels by incorporating the above result 
on E [T'(n)\, the result on the average number of slots E [Tij 
spent in the i E U -worst channels in Lemma 1 and the average 
number of collisions /7E[Y(/7, U)\ under perfect knowledge 
of pi in Lemma 2. 

Theorem 3 (Logarithmic Number of Collisions Under p RAND ): 
The expected number of collisions in the /7-best channels 
under p RAND (/7, (7, g MEAN ) scheme satisfies 


E [M(n)\ < U (E[Y(/7, U)] + l)E[T'(n) 


(19) 


Hence, from (16), (18) and (17), M(n ) = O(logn). 

Proof: See Appendix E. □ 

Hence, there are only a logarithmic number of expected 
collisions before the users settle in the orthogonal channels. 
Combining this result with Lemma 1 that the number of 
slots spent in the /7-worst channels is also logarithmic, we 
immediately have one of the main results of this paper 
that the sum regret under distributed learning and access is 
logarithmic. 

Theorem 4 (Logarithmic Regret Under p RAND ): The policy 
p RAND (C/, (7, gMEAN) in Algorithm 3 has ©(logn) regret. 

Proof: Substituting (19) and (16) in (15). □ 

Hence, we prove that distributed learning and channel 
access among multiple secondary users is possible with log¬ 
arithmic regret without any explicit communication among 
the users. This implies that the number of lost opportunities 
for successful transmissions at all secondary users is only 
logarithmic in the number of transmissions, which is negligible 
when there are a large number of transmissions. 

We have so far focused on designing schemes that maximize 
system or social throughput. We now briefly discuss the 
fairness for an individual user under p RAND . Since p RAND does 
not distinguish any of the users, in the sense that each user 
has equal probability of “settling" down in one of the /7-best 
channels while experiencing only logarithmic regret in doing 
so. Simulations in Section VII (in Fig. 4) demonstrate this 
phenomenon. 

We also note that the bound on regret under p RAND grows 
rapidly with the number of users U, due to the bound in 
(17). This is due to uniform channel access by the users 
without any coordination or information exchange. It is of 
interest to explore better distributed access schemes, which 
when combined with learning algorithms yield low regret in 
the number of users. 


Algorithm 4 Policy p EST (n, (7, gj (rn), £) for each user j under 
n transmission slots (horizon length), C channels, statistic 
g j(m) and threshold function £. 

1) Input: • Sample-mean availabilities 

at user j , gj(i\n)\ statistic based on Xij(n ), 

cr(a; gj(n)): index of a th highest entry in g j(n). 

n)\ indicator of collision at n th slot at channel i 
U : current estimate of the number of users. 
n: horizon (total number of slots for transmission) 

£(n; k): threshold functions for k = 1,..., C. 

2) Init: Sense each channel once, m (7, Curr_Sel <— 
(7, Curr_Rank <— 1, /7 <— 1, Q(i;m) 0 for all 
i = 1,..., C 

3) Loop 4 to 6: m <— m +1, stop when m = n or U = C. 

4) If (j{Curr_Sel ; m — 1) = 1 then 

Draw a new Curr_Rank Unif(Z7). end if 
Select channel for sensing. If free, transmit. 

Curr_Sel <— a(Curr_Rank; gj(rn)) 

5) (j(Curr_Sel;m ) <— 1 if collision, 0 o.w. 

6 ) if YZ= i £fc=i GM*;; sjH);«) > £(«; u)) then 

U <— U + 1, Q(i;a) 0, i = 1,... (7, a = 1,..., m. 

end if 


V. Distributed Learning and Access under 
Unknown Number of Users 

We have so far assumed that the number of secondary 
users is known, and is required for the implementation of 
the p RAND policy. In practice, this entails initial announcement 
from each of the secondary users to indicate their presence in 
the cognitive network. However, in a truly distributed setting 
without any information exchange among the users, such an 
announcement may not be possible. 

In this section, we consider the scenario, where the number 
of users U is unknown (but fixed throughout the duration of 
transmissions and U < (7, the number of channels). In this 
case, the policy needs to estimate the number of secondary 
users in the system, in addition to learning the channel 
availability statistics and designing channel access rules based 
on collision feedback. Note that if the policy assumed the 
worst-case scenario that U = (7, then the regret grows linearly 
since there is a positive probability that the U -worst channels 
are selected for sensing in any time slot. 


A. Description of p EST Policy 

We now propose a policy p EST in Algorithm 4. This policy 
incorporates two broad aspects in each transmission slot, viz., 
execution of the p RAND policy in Algorithm 3, based on the 
current estimate of the number of users Z7, and updating of 
the estimate U based on the number of collisions experienced 
by the user. 

The updating is based on the idea that if there is under¬ 
estimation of U at all the users ( Uj < U at all the users j ), 
collisions necessarily build up and the collision count serves 
as a criterion for incrementing U. This is because after a long 
learning period, the users learn the true ranks of the channels, 
and target the same set of channels. However, when there is 
















under-estimation, the number of users exceeds the number of 
channels targeted by the users. Hence, collisions among the 
users accumulate, and can be used as a test for incrementing 
U. 

Denote the collision count used by the p EST policy as 

m k 

$k,j(m) := ££CjH&;gjM);a)- (20) 

< 2=1 6=1 

which is the total number of collisions experienced by user j 
so far (till the m th transmission slot) in the top Uj -channels, 
where the ranks of the channels are estimated using the g- 
statistics. The collision count is tested against a threshold 
£(n;Uj), which is a function of the horizon length 7 and 
current estimate Uj. When the threshold is exceeded, Uj is 
incremented, and the collision samples collected so far are 
discarded (by setting them to zero) (line 6 in Algorithm 4). 
The choice of the threshold for incrementing Uj is critical; if 
it is too small it can result in over-estimation of the number 
of users. On the other hand, if it is too large, it can result in 
slow learning and large regret. Threshold selection is studied 
in detail in the subsequent section. 

B. Regret Bounds under p EST 

We analyze regret bounds under the p EST policy, where the 
regret is defined in (3). Let the maximum threshold function 
for the number of consecutive collisions under the p EST policy 
be denoted by 

£*(n;/7):= max £(n;fc). (21) 

We prove that the p EST policy has 0(£*(n;U)) regret when 
£*(n; U) = uj (log n), and where n is the number of transmis¬ 
sion slots. 

The proof for the regret bound under the p EST policy consists 
of two main parts: we prove bounds on regret conditioned 
on the event that none of the users over-estimate U. Second, 
we show that the probability of over-estimation at any of the 
users goes to zero asymptotically in the horizon length. Taken 
together, we obtain the regret bound for the p EST policy. 

Note that in order to have small regret, it is crucial that none 
of the users over-estimate U. This is because when there is 
over-estimation, there is a finite probability of selecting the 
(7-worst channels even upon learning the true ranks of the 
channels. Note that regret is incurred whenever a U-worst 
channel is selected since under perfect knowledge this channel 
would not be selected. Hence, under over-estimation, the regret 
grows linearly in the number of transmissions. 

In a nutshell, under the p EST policy, the decision to increment 
the estimate U reduces to a hypothesis-testing problem with 
hypotheses Ho'- number of users is less than or equal to the 
current estimate and Hi’, number of users is greater than 
the current estimate. In order to have a sub-linear regret, 
the false-alarm probability (deciding Hi under Ho) needs to 
decay asymptotically. This is ensured by selecting appropriate 

7 In this section, we assume that the users are aware of the horizon length 
n for transmission. Note that this is not a limitation and can be extended 
to case of unknown horizon length as follows: implement the algorithm by 
fixing horizon lengths to no, 2no, 4no ... for a fixed no E N and discarding 
estimates from previous stages. 


thresholds £(n) to test against the collision counts obtained 
through feedback. 

Conditional Regret: We now give the result for the first 
part. Define the “good event” C(n; U) that none of the users 
over-estimates U under p EST as 

u 

C(n;U):={f]U] ST (n)<U}. (22) 

3 = 1 


The regret conditioned on C(n;U ), denoted by 

R(n ; /x, U, p EST )\C(n; U ), is given by 

u c u 

n'£Kk*)-J2'E^mVi, j (n)\C(n;U)}, 

k =1 2=1 j —1 


where Vij(n) is the number of times that user j is the sole 
user of channel i. Similarly, we have conditional expectations 
of K[Tij(n)\C(n; U)] and of the number of collisions in U- 
best channels, given by E[M(n)|C(n; U)]. We now show that 
the regret conditioned on C(n; U) is 0(max(£*(n; /7),logn)). 

Lemma 4: (Conditional Regret): When all the U secondary 
users implement the p EST policy, we have for alii E U -worst 
channel and each user j = 


(n) | C(n) 


u 

k=l 


8 log n 7 r 2 

---b 1 H- 

A (i,/c*) 2 3 


(23) 


The conditional expectation on number of collisions M(n) in 
the U -best channel satisfies 


u 

E[M(n)\C(n; U)]<U^2 £(n; k) < U 2 C(n; U). (24) 

k =1 

From (15), we have R(n)\C(n; U) is 0(max(£*(n; (7), logn)) 
for any n E N. 

Proof: See Appendix F. □ 

Probability of Over-estimation: We now prove that none 
of the users over-estimates 8 U under the p EST policy, i.e., the 
probability of the event C(n\U) in (22) approaches one as 
n —> oo, when the thresholds £( n\U ) for testing against 
the collision count are chosen appropriately (see line 6 in 
Algorithm 4). Trivially, we can set £(n; 1) = 1 since a single 
collision is enough to indicate that there is more than one user. 
For any other k > 1, we choose function £ satisfying 

£(n; k) = c<;(log n), Vfc > 1. (25) 


We prove that the above condition ensures that over-estimation 
does not occur. 

Recall that T'(n; p EST ) is the number of slots where any 
one of the top /7-estimated ranks of the channels at some 


is 


user is wrong under the p EST policy. We show that E [T'(n) 
0(log n). 

Lemma 5 (Time spent with wrong estimates): The 
expected number of slots where any of the top /7-estimated 


8 Note that the p EST policy automatically ensures that all the users do not 
under-estimate U, since it increments U based on collision estimate. This 
implies that the probability of the event that all the users under-estimate U 
goes to zero asymptotically. 
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ranks of the channels at any user is wrong under the p EST 
policy satisfies 


u 


c 


E [T'(n)]<uJ2 E 


a=l6=a+1 


8 logn 
A (a*,6*) 2 


+ 1 + 


ir 


(26) 


Proof: The proof is on the lines of Lemma 3 □ 

Recall the definition of T(£7, U) in the previous section, 
as the maximum time to absorption starting from any initial 
distribution of the finite-state Markov chain, where the states 
correspond to different user configurations and the absorbing 
state corresponds to the collision-free configuration. We now 
generalize the definition to T(£7, fc), as the time to absorption 
in a new Markov chain, where the state space is the set of 
configurations of U users in k channels, and the transition 
probabilities are defined on similar lines. Note that T(C7, k) 
is almost-surely finite when k > U and oo otherwise (since 
there is no absorbing state in the latter case). 

We now bound the maximum value of the collision count 
3>fcj(ra) under the p EST policy in (20) using T'(m), the total 
time spent with wrong channel estimates, and T(£7, fc), the 

s t 

time to absorption in the Markov chain. Let < denote the 
stochastic order for two random variables [25]. 

Proposition 2: The maximum collision count in (20) over 
all users under the p EST policy satisfies 


max + 1)T(C7, fc), Vm E N. (27) 

j=l,...,U 

Proof: The proof is on the lines of Theorem 3. See Ap¬ 
pendix G. □ 

We now prove that the probability of over-estimation goes 
to zero asymptotically. 

Lemma 6 (No Over-estimation Under p EST ): For threshold 
function £ satisfying (25), the event C(n; U ) in (22) satisfies 


lim F[C(n;U)\ = 1, (28) 

oo 


and hence, none of the users over-estimates U under the p EST 
policy. 

Proof: See Appendix H. □ 

We now give the main result of this section that p EST has 
slightly more than logarithmic regret asymptotically and this 
depends on the threshold function £*(n; U) in (21). 

Theorem 5 (Asymptotic Regret Under p EST ): With 
threshold function £ satisfying conditions in (25), the 
policy p EST (n, C, gj (ra), £) in Algorithm 4 satisfies 


lim sup 

n—^ oo 


R(n ; n, U, p^) 
£*0; U) 


< OO. 


(29) 


Proof: From Lemma 4 and Lemma 6. □ 

Hence, the regret under the proposed p EST policy is 
0(£*(n;U)) under fully decentralized setting without the 
knowledge of number of users when £*(n;U) = o;(logn). 
Hence, 0(f(n)\ogn) regret is achievable for all functions 
f(n) —> oo as n —> oo. The question of whether logarithmic 
regret is possible under unknown number of users is of 
interest. 

Note the difference between the p EST policy in Algorithm 4 
under unknown number of users with the p RAND policy with 
known number of users in Algorithm 3. The regret under p EST 


is 0(/(n)logn) for any function f(n) = cj( 1), while it is 
0(log n) under the p RAND policy. Hence, we are able to quantify 
the degradation of performance when the number of users is 
unknown. 


VI. Lower Bound & Effect of Number of Users 


A. Lower Bound For Distributed Learning & access 

We have so far designed distributed learning and access 
policies with provable bounds on regret. We now discuss the 
relative performance of these policies, compared to the optimal 
learning and access policies. This is accomplished by noting 
a lower bound on regret for any uniformly-good policy, first 
derived in [4] for a general class of uniformly-good time- 
division policies. We restate the result below. 

Theorem 6 (Lower Bound [4]): For any uniformly good 
distributed learning and access policy p, the sum regret in 
(2) satisfies 


lim inf > V V KT^-v 

?woo logn ' Dip*. Un*) 

^ iEU-worst j=l ) 


(30) 


The lower bound derived in [9] for centralized learning and 
access holds for distributed learning and access considered 
here. But a better lower bound is obtained above by consid¬ 
ering the distributed nature of learning. The lower bound for 
distributed policies is worse than the bound for the centralized 
policies in (11). This is because each user independently learns 
the channel availabilities p in a distributed policy, whereas 
sensing decisions from all the users are used for learning in a 
centralized policy. 

Our distributed learning and access policy p RAND matches 
the lower bound on regret in (15) in the order (logn) but the 
scaling factors are different. It is not clear if the regret lower 
bound in (30) can be achieved by any policy under no explicit 
information exchange and is a topic for future investigation. 


B. Behavior with Number of Users 

We have so far analyzed the sum regret under our policies 
under a fixed number of users U. We now analyze the behavior 
of regret growth as U increases while keeping the number of 
channels C > U fixed. 

Theorem 7 (Varying Number of Users): When the number 
of channels C is fixed and the number of users U < C is 
varied, the sum regret under centralized learning and access 
p CENT in (12) decreases as U increases while the upper bounds 
on the sum regret under p RAND in (15) monotonically increases 
with U. 

Proof: The proof involves analysis of (12) and (15). To 

prove that the sum regret under centralized learning and access 
in (12) decreases with the number of users U, it suffices to 
show that for i E U -worst channel, 

A (FV) 

D(p,i, pu*) 

decreases as U increases. Note that p(U*) and D(pi,pu*) 
decrease as U increases. Hence, it suffices to show that 

D(pi , pu *) 
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decreases with U. This is true since its derivative with respect 
to U is negative. 

For the upper bound on regret under p RAND in (15), when U 
is increased, the number of /7-worst channels decreases and 
hence, the first term in (15) decreases. However, the second 
term consisting of collisions M(n) increases to a far greater 
extent. □ 

Note that the above results are for the upper bound on regret 
under the p RAND policy and not the regret itself. Simulations in 
Section VII reveal that the actual regret also increases with U. 
Under the centralized scheme p CENT , as U increases, the number 
of /7-worst channels decreases. Hence, the regret decreases, 
since there are less number of possibilities of making bad 
decisions. However, for distributed schemes although this 
effect exists, it is far outweighed by the increase in regret 
due to the increase in collisions among the U users. 

In contrast, the distributed lower bound in (30) displays 
anomalous behavior with U since it fails to account for 
collisions among the users. Here, as U increases there are 
two competing effects: a decrease in regret due to decrease 
in the number of Z7-worst channels and an increase in regret 
due to increase in the number of users visiting these U -worst 
channels. 

VII. Numerical Results 

We present simulations for the algorithms developed in 
the paper, varying the number of users and channels to 
verify the performance of the algorithms detailed earlier. We 
consider C= 9 channels (or a subset of them when the num¬ 
ber of channels is varying) with probabilities of availability 
characterized by Bernoulli distributions with evenly spaced 
parameters ranging from 0.1 to 0.9. 

Comparison of Different Schemes: Fig. 2a compares the 
regret under the centralized and random allocation schemes in 
a scenario with U = 4 cognitive users vying for access to the 
(7 = 9 channels. The theoretical lower bound for the regret 
in the centralized case from Theorem 2 and the distributed 
case from Theorem 6 are also plotted. The upper bounds on 
the random allocation scheme from Theorem 4 is not plotted 
here, since the bounds are loose especially as the number of 
users U increases. Finding tight upper bounds is a subject of 
future study. 

As expected, centralized allocation has the least regret. 
Another important observation is the gap between the lower 
bounds on the regret and the actual regret in both the dis¬ 
tributed and the centralized cases. In the centralized scenario, 
this is simply due to using the g MEAN statistic in (34) instead 
of the optimal g OPT statistic in (5). However, in the distributed 
case, there is an additional gap since we do not account for 
collisions among the users. Hence, the schemes under con¬ 
sideration are O(logn) and achieve order optimality although 
they are not optimal in the scaling constant. 

Performance with Varying U and C: Fig. 3a explores the 
impact of increasing the number of secondary users U on the 
regret experienced by the different policies with the number 
of channels C fixed. With increasing Z7, the regret decreases 
for the centralized schemes and increases for the distributed 
schemes, as predicted in Theorem 7. The monotonic increase 



x 10 4 


(a) Normalized regret vs. n slots. 

U = 4 users, C = 9 channels. 



(b) Normalized regret vs. n slots. 

U — 4 users, C — 9 channels. 



(c) No. of collisions M(n) vs. n slots. 

U = 4 users, C — 9 channels, p RAND 
policy. 

Fig. 2. Simulation Results. Probability of Availability /x = 
[0.1, 0.2,.. . ,0.9]. 

of regret under random allocation p RAND is a result of the 
increase in the collisions as U increases. On the other hand, 
the monotonic decreasing behavior in the centralized case is 
because as the number of users increases, the number of U- 
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(a) Normalized regret vs. U users. 

C — 9 channels, n — 2500 slots. 



(b) Normalized regret vs. C channels. 
U — 2 users, n = 2500 slots. 



(c) Normalized regret vs. U users. 

User-channel ratio U/C = 0.5, n = 2500 slots. 

Fig. 3. Simulation Results. Probability of Availability /x 
[0.1, 0.2,... ,0.9]. 


decrease in number of U -worst channels and increase in regret 
due to increase in number of users visiting these /7-worst 
channels. 

Fig. 3b evaluates the performance of the different algorithms 
as the number of channels C is varied while fixing the number 
of users U. The probability of availability of each additional 
channel is set higher than those already present. Here, the 
regret monotonically increases with C in all cases. When the 
number of channels increases along with the quality of the 
channels, the regret increases as a result of an increase in the 
number of U -worst channels as well as the increasing gap in 
quality between the U -best and U -worst channels. 

Also, the situation where the ratio U/C is fixed to be 0.5 
and both the number of users and channels along with their 
quality increase is considered in Fig. 3c and we find that 
the regret grows in this case as well. Once again, this is in 
agreement with theory since the number of U -worst channels 
increases as U and C increase while keeping U/C fixed. 

Collisions and Learning: Fig. 2c verifies the logarithmic 
nature of the number of collisions under the random alloca¬ 
tion scheme p RAND . Additionally, we also plot the number of 
collisions under p RAND in the ideal scenario when the channel 
availability statistics (x are known to see the effect of learning 
on the number of collisions. The low value of the number 
of collisions obtained under known channel parameters in 
the simulations is in agreement with theoretical predictions, 
analyzed as Z7E[Y(Z7, U)] in Lemma 2. As the number of 
slots n increases, the gap between the number of collisions 
under the known and unknown parameters increases since the 
former converges to a finite constant while the latter grows as 
O(logn). The logarithmic behavior of the cumulative number 
of collisions can be inferred from Fig. 2a. However, the curve 
in Fig. 2c for the unknown parameter case appears linear in 
n due to the small value of n. 

Difference between g OPT and g MEAN : Since the statistic g MEAN 
used in the schemes in this paper differs from the optimal 
statistic g OPT in (5), a simulation is done to compare the perfor¬ 
mance of the schemes under both the statistics. As expected, in 
Fig. 2b, the optimal scheme has better performance. However, 
the use of g MEAN enables us to provide finite-time bounds, as 
described earlier. 

Fairness: One of the important features of p RAND is that 
it does not favor any one user over another. Each user has 
an equal chance of settling down in any one of the /7-best 
channels. Fig. 4 evaluates the fairness characteristics of p RAND . 
The simulation assumes U = 4 cognitive users vying for 
access to C = 9 channels. The graph depicts which user 
asymptotically gets the best channel over 1000 runs of the 
random allocation scheme. As can be seen, each user has 
approximately the same frequency of being allotted the best 
channel indicating that the random allocation scheme is indeed 
fair. 


worst channels decreases resulting in lower regret. Also, the 
lower bound for the distributed case in (30) initially increases 
and then decreases with U. This is because as U increases 
there are two competing effects: decrease in regret due to 


VIII. Conclusion 

In this paper, we proposed novel policies for distributed 
learning of channel availability statistics and channel access 
of multiple secondary users in a cognitive network. The first 
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Fig. 4. Simulation Results. Probability of Availability /x = 
[0.1, 0.2,... , 0.9]. No. of slots where user has best channel vs. user. U = 4, 
C = 9, n = 2500 slots, 1000 runs, p RAND . 


policy assumed that the number of secondary users in the 
network is known, while the second policy removed this 
requirement. We provide provable guarantees for our policies 
in terms of sum regret. By noting the lower bound on regret for 
any uniformly-good learning and access policy, we find that 
our first policy achieves order-optimal regret while our second 
policy is also nearly order optimal. Our analysis in this paper 
provides insights on incorporating learning and distributed 
medium access control in a practical cognitive network. 

The results of this paper open up an interesting array of 
problems for future investigation. Simulations suggest that our 
lower and upper bounds are not tight in terms of the scaling 
constant and that better bounds are needed. Our assumptions 
of an i.i.d. model for primary user transmissions and perfect 
sensing at the secondary users need to be relaxed. Our policy 
allows for an unknown but fixed number of secondary users, 
and it is of interest to incorporate users dynamically entering 
and leaving the system. Moreover, our model ignores dynamic 
traffic at the secondary nodes and extension to a queueing- 
theoretic formulation is desirable. We consider the worst-case 
scenario that there is no information exchange among the sec¬ 
ondary users. Extension to the case with limited information 
exchange is of interest. The proposed schemes are designed 
to be simple and tractable for mathematical analysis. It is 
desirable to obtain more robust schemes with additional design 
considerations for real-world deployment. 
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Appendix 


A. Proof of Theorem 2 

The result in (13) involves extending the results of [11, 
Thm. 1]. Define T{ (n):= X;=i Ti, j(n ) as the number of times 
a channel i is sensed in n rounds for all users. We will show 
that 


m(n)]< £ 

k£U -best 

We have 


8 logn 
A (fc*, i) 2 


+ 1 + 


7T 


\/i E /7-worst. 


(31) 


P[Tx. in i in n th slot] = P[g(/7*; n ) < g(i; n)], 
=F[A(i; n ) n ( g(U *; n) < g(i ; n)) 

+ P[7L c (i; n) fl (, g(U *; n) < g(i ; n))], 


where 

A(i;n):= (J (g(k;n) < g{i;n)) 

/cEt/-best 

is the event that at least one of the /7-best channels has g- 
statistic less than i. Hence, from union bound we have 

P[y4(i;n)]< P [g(k\n) < g{i\ri) . 

k£U -best 

We have for C > U, 

¥[A c (i; n ) D (g(U*; n ) < g(i; n))] = 0, 


Hence, 


P[Tx. in i in n th round] < E P[g(k;n) < g(i\ n)}. 

kE- best 

On the lines of [11, Thm. 1], we have Vfc,z 
k is U -best, i is U -worst 


n 


< g{v,l)} < —— 
1=1 v ’ 


8 log n 7T 


+ 1 + 


) 2 ' " ' 3 


Hence, we have (31). Lor the bound on regret, we can break 
R in (2) into two terms 


R(n; fi,U,p CEm ) = lu E A ( r ’7 E[Ti(n ) 

iGl/-worst ^=1 


1 


U 


1 


u 


+ E [^E A ( r >7 E[Ti(n)] 

i£U- best £=1 


For the second term, we have 


U 

iClU- best 1=1 


1 


U 


E m(n)\ 


<E[T*(n)] £ [-E A M 

i£U- best ^=1 


1 


U 


= 0, 


where T*(n):= max TAn). Hence, we have the bound. □ 

ieU- best 
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B. Proof of Proposition 1 

For convenience, let Ti(n) := Vi(n) := 

JE =1 L^j(n). Note that JE =1 Ti(n) = nU , since each user 
selects one channel for sensing in each slot and there are U 
users. From (3), 


u 


c 


R(n ) =nJ2^(,n - [V(n) 


2=1 


2=1 


< E Mi)(«-E[Vi(n)]) 

iElZ-best 

</z(l*)(n(7- ^ E[V-(n)]) 

iElZ-best 

=/z(l*)(E[M(n)] + ^ E[Tj(n)]) 

iEU- worst 


(32) 


(33) 


where Eqn.(32) uses the fact that Vi(n) < n since total number 
of sole occupancies in n slots of channel i is at most n, and 
Eqn.(33) uses the fact that M(n ) = X^et/-best(^( n ) — Vi(n)). 

For the lower bound, since each user selects one channel 
for sensing in each slot, Yli=i Sj=i 21 j(n) = nU. Now 
Tij(n) E Vij(n). 


1 

R{n\l*iU,p) > — 


u u c 



A (Ufi)E[T id (n) 


k —1 j —1 i=l 


U 


>E E AtfT^E^n) 

j=l iElZ-worst 


□ 


C. Proof of Lemma 2 

Although, we could directly compute the time to absorption 
of the Markov chain, we give a simple bound E[T(C7, U)\ by 
considering an i.i.d process over the same state space. We term 
this process as a genie-aided modification of random allocation 
scheme, since this can be realized as follows: in each slot, a 
genie checks if any collision occurred, in which case, a new 
random variable is drawn from Unif (U) by all users. This is 
in contrast to the original random allocation scheme where a 
new random variable is drawn only when the particular user 
experiences a collision. Note that for U = 2 users, the two 
scenarios coincide. 

For the genie-aided scheme, the expected number of slots to 
hit orthogonality is just the mean of the geometric distribution 


E fc ( i -p) k p 

k=l 


1 — p 

V 


< 00 , 


(34) 


where p is the probability of having an orthogonal configura¬ 
tion in a slot. This is in fact the reciprocal of the number of 
compositions of U [24, Thm. 5.1], given by 



cvr 


(35) 


The above expression is nothing but the reciprocal of number 
of ways U identical balls (users) can be placed in U different 
bins (channels): there are 2/7 — 1 possible positions to form 
U partitions of the balls. 


Now for the random allocation scheme without the genie, 
any user not experiencing collision does not draw a new 
variable from Unif(Z7). Hence, the number of possible config¬ 
urations in any slot is lower than under genie-aided scheme. 
Since there is only one configuration satisfying orthogonality 9 , 
the probability of orthogonality increases in the absence of the 
genie and is at least (35). Hence, the number of slots to reach 
orthogonality without the genie is at most (34). Since in any 
slot, at most U collisions occur, (17) holds. □ 


D. Proof of Lemma 3 


Let c 


_. / 2 log n 


n ,m ■ 


m 


Case 1: Consider U = C = 2 first. Let 

A(t, /):={g^ EAN (l*; t — 1) < ^ MEAN (2*; t - 1), T'{t - 1) > /}. 

On lines of [11, Thm. 1], 


72 


T» < J + £ 44(2,0], 


t=2 

oo t 


E ^ T E E 1 (^X\* j(fl) T E X 2 * : j {jYl) T Ct,m) 

t =1 m-\-h—l 

The above event is implied by 

X\ * ,j{fi) + Ct,h — X 2 *j(h) T )-m 

since 

The above event implies at least one of the following events 
and hence, we can use the union bound. 

Xi*,j{h) E P'1* 

X 2 *,j(jtt) E p2 * T 0,/l + r?2 5 

Pi* <C p2 * 2ct,h+m- 

From the Chernoff-Hoeffding bound, 

P[Xi *j(t) < pi* - ct th \ < f -4 , 

E p2 * T Ct,h-\-m\ E t 1 

and the event that pi* < P 2 * + 2 c t ,h+m implies that 

8 log t 


h + m < 


Since 


00 



A 2 

^1* 2* 


2t~ 4 = 


7T 


t—1 722=1 h=l 


3 ’ 


EP>; U = C = 2)} <— 2 


8 log n . 7T 


+ 1 + 


1 * ,2 = 


Case 2: For min(Z7, C ) > 2, we have 

U C n 

T'(n)<u ^2 E E / ( 5 “ ean («*;H< 5 “ ean (&*;H), 

a— 1 6 =a+l 722=1 

where a* and 6* represent channels with a th and 6 th highest 
availabilities. On lines of the result for U = C = 2, we can 
show that 


72 


E/[<?f AN (a*;m) < gf m {b*-m) 


< 


8 log n . 7T 


722=1 

Hence, (18) holds. 

9 since all users are identical for this analysis. 


A 2 

a* ,6* 


+ 1 + 


□ 
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E. Proof of Theorem 3 


H. Proof of Lemma 6 


Define the good event as all users having correct top ZE- 
order of the ^-statistics, given by 


u 

S(n):= (^{Top-ZE entries of g j(n) are same as in p,}. 

3 = 1 

The number of slots under the bad event is 


E 7 [s c M* 



by definition of T'{n). In each slot, either a good or a bad 
event occurs. Let 7 be the total number of collisions in ZE- 
best channels between two bad events, i.e., under a run of 
good events. In this case, all the users have the correct top 
ZE-ranks of channels and hence, 


E[ 7 |S(n)] < ZEE[Y(ZE, ZE)] < oc : 


where E[Y(ZE, U)\ is given by (17). Hence, each transition 
from the bad to the good state results in at most ZEE[Y(ZE, U)\ 
expected number of collisions in the ZE-best channels. The 
expected number of collisions under the bad event is at most 
ZEE [T'(n)]. Hence, (19) holds. □ 


F. Proof of Lemma 4 

Under C(n; ZE), a ZE-worst channel is sensed only if it is 
mistaken to be a ZE-best channel. Hence, on lines of Lemma 1, 


We are interested in 


P[C c (n); U] = P[uf = 1 £/f» > U], 

n U 

= P[|J U {$uAm) > 

m —1 j =1 

= P[ max > £(n;t/)], 

where <f> is given by (20). For ZE = 1, we have P[C c (n); ZE] = 0 
since no collisions occur. 

Using (27) in Proposition 2, 


k 


P[max (n) > £(n; k)] 

3 = 1 

< F[kT(U, k)(T'(n) + 1) > £(n; k)} 

< P [k{T\n) + 1) > iE-E] + p[T(C7, k) > a 


a 


n 


< 


ka n (E[T'(n)\ + 1) 

€(n;k) 


+ P[T(ZE, k) > a n 


(36) 


using Markov inequality. By choosing a n = o;(l), the second 
term in (36), viz., P[Y(ZE, k) > a n ] — > 0 as n —» oo, for 
k > ZE. For the first term, from (26) in Lemma 5, E[T'(n) = 
O(logn). Hence, by choosing a n = o(£*(n; k)/ logn), the 
first term decays to zero. Since £*(n; ZE) = a; (logn), we can 
choose a n satisfying both the conditions. By letting k — U in 
(36), we have P[C c (n); ZE] 0 as n — > oo, and (28) holds. □ 
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Abstract —Managing trust in a distributed Mobile Ad Hoc Net¬ 
work (MANET) is challenging when collaboration or cooperation 
is critical to achieving mission and system goals such as reliability, 
availability, scalability, and reconfigurability. In defining and 
managing trust in a military MANET, we must consider the 
interactions between the composite cognitive, social, information 
and communication networks, and take into account the severe 
resource constraints (e.g., computing power, energy, bandwidth, 
time), and dynamics (e.g., topology changes, node mobility, node 
failure, propagation channel conditions). We seek to combine 
the notions of “social trust” derived from social networks with 
“quality-of-service (QoS) trust” derived from information and 
communication networks to obtain a composite trust metric. We 
discuss the concepts and properties of trust and derive some 
unique characteristics of trust in MANETs, drawing upon social 
notions of trust. We provide a survey of trust management 
schemes developed for MANETs and discuss generally accepted 
classifications, potential attacks, performance metrics, and trust 
metrics in MANETs. Finally, we discuss future research areas 
on trust management in MANETs based on the concept of social 
and cognitive networks. 

Index Terms—Trust management, mobile ad hoc networks, 
social networks, cognitive networks, trust, trust metrics. 

I. Introduction 

N AN INCREASINGLY networked world, increased con¬ 
nectivity could lead to improved information sharing, fa¬ 
cilitate collaboration, and enable distributed decision making, 
which is the underlying concept in Network Centric Opera¬ 
tions. In mobile ad hoc networks (MANETs), the distributed 
decision making should take into account trust in the elements: 
the sources of information, the processors of information, the 
elements of the communications network across which the 
information is transmitted, etc. This trust must often be derived 
under time-critical conditions, and in a distributed way. 

A. Design Challenges in MANET Protocols 

A mobile ad hoc network [1] consists of wireless mobile 
nodes forming a temporary network without the help of cen¬ 
tralized infrastructure, and where nodes communicate through 
multi-hops. 

Security protocol designers for MANETs face technical 
challenges due to severe resource constraints in bandwidth, 
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memory size, battery life, computational power, and unique 
wireless characteristics such as openness to eavesdropping, 
lack of specific ingress and exit points, high security threats, 
vulnerability, unreliable communication, and rapid changes 
in topologies or memberships because of user mobility or 
node failure [1][2][3]. In addition, compared with designing 
security protocols for civilian MANETs, designing security 
protocols for military MANETs requires additional caution, 
since battlefield communication networks must cope with 
hostile environments, node heterogeneity, often stringent per¬ 
formance constraints, node subversion, high tempo operations 
leading to rapid changes in network topology and service 
requirements, and dynamically formed communities of in¬ 
terest wherein participants may not have predefined trust 
relationships [4]. To cope with these dynamics, networks 
must be able to reconfigure seamlessly, via low-complexity 
distributed network management schemes [3]. Security in a 
tactical network includes notions of communication security 
which can be easily quantified as opposed to the perception 
of security which is hard to quantify. 

B. Motivation for Trust Management in MANETs 

The concept of ’’Trust” originally derives from social sci¬ 
ences and is defined as the degree of subjective belief about 
the behaviors of a particular entity [5]. Blaze et al. [6] first 
introduced the term ’’Trust Management” and identified it as 
a separate component of security services in networks and 
clarified that ’’Trust management provides a unified approach 
for specifying and interpreting security policies, credentials, 
and relationships.” 

Trust management in MANETs is needed when partic¬ 
ipating nodes, without any previous interactions, desire to 
establish a network with an acceptable level of trust rela¬ 
tionships among themselves. Examples would be in building 
initial trust bootstrapping [7], coalition operations without 
predefined trust, and authentication of certificates generated 
by another party when links are down or ensuring safety 
before entering a new zone [8]. In addition, trust management 
has diverse applicability in many decision making situations 
including intrusion detection, authentication, access control, 
key management, isolating misbehaving nodes for effective 
routing, and other purposes. 

Trust management, including trust establishment, trust up¬ 
date, and trust revocation, in MANETs is also much more 
challenging than in traditional centralized environments. For 
example, collecting trust information or evidence to evalu¬ 
ate trustworthiness is difficult due to changes in topology 
induced by node mobility or node failure. Further, resource 
constraints often confine the trust evaluation process only 
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Fig. 1. The multidisciplinary concept of trust and its application in communications and networking. 


to local information. The dynamic nature and characteristics 
of MANETs result in uncertainty and incompleteness of the 
trust evidence, which is continuously changing over time [8] 
[9]. Despite a couple of surveys of trust management [10] 
[11] [12], a comprehensive survey of trust management in 
MANETs does not exist and is the main aim of this paper. 
A short version of this paper was presented at ICCRTS 2009 
[13]. The contributions of this paper are: (1) to give a clear 
definition of trust in the communication and networking field, 
drawing upon definitions from different disciplines; (2) to 
extensively survey the existing trust management schemes 
developed for MANETs and investigate their general trends; 
and (3) to discuss future research areas based on the concept 
of social and cognitive networks. 

The rest of this paper is organized as follows. In Section 2, 
we discuss the concept of trust in diverse disciplines, give a 
clear distinction between trust and trustworthiness, and discuss 
the relationship between trust and risk. We also introduce 
the main properties of trust in MANETs. Section 3 surveys 
generally accepted classifications of trust management, at¬ 
tacks considered in existing trust management schemes for 
MANETs, and metrics used to measure the performance 
of existing MANET trust management schemes. Section 4 
surveys trust management schemes that have been developed 
for specific purposes, including secure routing, authentication, 
intrusion detection, access control, key management, and trust 
evidence distribution and evaluation. In Section 5, we discuss 
design concepts that designers of MANET trust management 
systems should keep in mind and suggest trust metrics based 
on the concepts of social trust and quality-of-service (QoS) 
trust. Section 6 concludes this paper. 

II. Concepts and Properties of Trust 

In this section, we review how trust is defined in different 
disciplines and how these trust concepts can be applied in 
modeling trust in MANETs. Further, we examine the relation¬ 
ship between trust and risk, and how trust should be defined 
in order to realistically reflect the unique characteristics of 
MANETs. 

A. Multidisciplinary Concept of Trust 

According to Merriam Webster's Dictionary [14], trust is 
defined as ’’assured reliance on the character, ability, strength, 


or truth of someone or something.” Despite the subjective 
nature of trust, the concept of trust has been very attractive 
to network security protocol designers because of its diverse 
applicability as a decision making mechanism. We examine 
the literature to study how trust is defined in various disci¬ 
plines including sociology, economics, philosophy, psychol¬ 
ogy, organizational management, and autonomic computing 
in industrial and system engineering. Finally, we also examine 
how trust can be defined in communications and networking 
with the help of definitions in other fields. 

Trust in sociology : Gambetta’s notion of trust [15] is popu¬ 
larly called sociological trust and is defined as an assessor’s a 
priori subjective probability that a person (or agent, or group) 
will perform specific actions that affect the assessor. That is, 
Gambetta [15] describes the nature of trust as subjectivity, 
an indicator for future actions, and dynamicity based on 
continuous interactions between two entities. Luhmann [16] 
also emphasized the importance of trust in society as a 
mechanism for building cooperation among people to extend 
human interactions for future collaboration. Adams et al. [17] 
rephrased Gambetta’s trust concept in applying the sociolog¬ 
ical concept of trust in computer science; they represented 
trust as a continuous variable, quantifying trust in the light 
of context or acceptance of risk. They further stressed that 
risking betrayal is an important aspect in building trust. To 
be useful, network trust models must capture this subjective 
aspect of social trust. 

Trust in economics : Economists distinguish between the 
personal, informal trust that comes from being friendly with 
your neighbors and the impersonal, institutionalized trust that 
lets you give your credit card number out over the Internet 
[18]. Both notions of trust are important in military MANETs. 
In economics, trust is represented as an expectation that 
applies to situations in which trustors take risky actions under 
uncertainty or information incompleteness [19]. However, as 
illustrated in the Prisoner's Dilemma (PD) game [20], trust in 
economics is based on the assumption that humans are rational 
and strict utility maximizers of their own interest or incentives. 
In this sense, when we apply a human trust model to a network 
trust model, the assumption of selfish nodes seems reasonable. 
But altruistic behaviors can emerge from mechanisms that may 
be initially purely selfish [21], and thus making an argument 
for redemption mechanisms. Economic models are used in 
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conjunction with trust-based encryption primitives in [22] to 
develop a trust management paradigm for securing information 
flows across organizations. 

Trust in philosophy : According to the Stanford Encyclope¬ 
dia of Philosophy [23], trust is important but dangerous. Since 
trust allows us to form relationships with others and to rely on 
others for love, advice, help, etc., trust is regarded as a very 
important factor in our life that compels others to give us such 
things with no outside force such as the law. On the other 
hand, since trust requires taking a risk that the trustee may 
not behave as the trustor expects, trust is dangerous implying 
the possible betrayal of trust. In his comments on Lagerspetz’s 
book titled Trust: The Tacit Demand, Lahno [24] describes the 
author’s view on trust as a moral relationship in human society. 
Langerspetz believes that investigations of trust reveal that 
’’human individuals, their beliefs, desires and actions are only 
intelligible against the background of existing social practices 
and social ties” [24]. This implies that depending on the nature 
of personal relationships between a trustor and a trustee (i.e., 
moral relationship between them), trustful actions or betrayal 
can occur. 

Trust in psychology: According to the Wikipedia definition 
of trust in psychology [25], trust starts from the birth of the 
child. As the child grows older, trust also grows stronger. 
However, the root of trust derives from the relationship be¬ 
tween mother (or caregiver) of the child since the strength of 
the family relies on trust, if the child is raised in a family 
which is very accepting and loving, the child also returns 
those feelings to others by trusting them. But if trust is lost, 
it is hard to regain it. In this sense, trust in psychology 
emphasizes the cognitive process that human beings learn 
trust from their experiences. Deutsch [26] defines trust as the 
confidence that one will find what is desired from another 
rather than what is feared. An individual may be said to 
have trust in the occurrence of an event if he expects its 
occurrence and his expectation leads to the behavior which 
he perceives to have greater negative consequences if the 
expectation is not confirmed than positive consequences if it is 
confirmed. In addition, Hardin [27] and Rotter [28] observed 
in their experiments that past experience may strikingly affect 
later capacity for trust. For example, bad experience with 
people will lower the trust level, leading to fewer trusted 
relationships with people, and thus fewer opportunities for 
mutual gain. Further, they recognized that the gains obtained 
by having high trust relationships exceed the loss by having 
low trust relationships. For instance, high trustors are less 
likely to lie or cheat or steal. Also they are less likely to be 
unhappy, conflicted, or unstable, and sought by more friends. 
Even though high trustors are deceived more often in novel 
situations, low trustors are also fooled equally by distrusting 
trustworthy people, thereby losing the advantages that high 
trustors may have [28]. 

Trust in organizational management In this field, the 
concept of trust is also defined as the extent to which one party 
is willing to count on someone or something with a feeling 
of relative security in spite of possible negative consequences, 
emphasizing the possibility of facing risk [29]. Schoorman 
et al. [30] defined trust as the willingness to take a risk or 
willingness to be vulnerable in the relationship in terms of 


ability, integrity, and benevolence. They also explained that 
trust is not necessarily mutual and is not reciprocal. Trust 
concepts in organizational management can give us insights 
on how to measure trust by investigating methods to measure 
ability, integrity, and benevolence of each networked node, 
as well as on assessing risk. They can also give us insights 
on defining group trust (i.e., between a person and a group or 
between groups) which is important for dynamic communities 
of interest. 

Trust in autonomic computing : As technology becomes 
more complex, fully understanding automation becomes in¬ 
feasible, if not impossible, and trust in automation becomes 
critical, particularly when unexpected situations arise and 
system responses cannot be predicted. Researchers studying 
autonomic computing in industrial systems engineering have 
sought to develop models of trust to understand how trust in 
automation develops and how it may be misplaced. Lee and 
See [31] define trust as the attitude that an agent will help 
accomplish an individual’s goals in a situation with uncertainty 
and vulnerability. In this sense, an agent can be automation or 
another person that actively interacts with the environment on 
behalf of the person. Parasuraman [32] links the level of trust 
with automation reliability stating that ’’Trust often determines 
automation usage. Operators may not use a reliable automated 
system if they believe it to be untrustworthy.” The notion of 
automation reliability as a trust metric is one that is applicable 
in MANETs, where the user’s trust in reliability on technology 
is an important aspect. 

Trust in communications and networking : The concept of 
trust also has been attractive to communication and network 
protocol designers where trust relationships among participat¬ 
ing nodes are critical in building cooperative and collabora¬ 
tive environments to optimize system objectives in terms of 
scalability, reconfigurability, and reliability (i.e., survivability), 
dependability, or security. According to Eschenauer et al. [9], 
trust is defined as ”a set of relations among entities that 
participate in a protocol. These relations are based on the 
evidence generated by the previous interactions of entities 
within a protocol. In general, if the interactions have been 
faithful to the protocol, then trust will accumulate between 
these entities.” Capra [34] proposes to use a human trust 
model based on human interactions in a trust model for fully 
distributed network environments such as MANETs. Capra 
defines trust as the degree of a belief about the behavior of 
other entities (or agents). Li and Singhal [35] define trust 
as the belief that an entity is capable of performing reliably, 
dependably, and securely in a particular case; hence, different 
levels of trust exist in different contexts. For example, Alice 
may trust her physician to give her advice on her health 
concerns but may not trust her physician’s advice on fixing 
her car. Aivaloglou et al. [36] describe trust as the quantified 
belief of a trustor regarding competence, honesty, security, and 
dependability of a trustee in a specific context. 

Recently, researchers have recognized the importance of 
social networks in building trust relationships among entities. 
Golbeck [37] [38] [39] introduces the concept of social trust 
by suggesting the use of social networks as a bridge to 
build trust relationships among entities. Golbeck proposes the 
application of a trust concept derived from a sociological 
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Fig. 2. Trust level [42], 

viewpoint to computer science, and describes trust as a well- 
defined descriptor of security and encryption as a metric to 
reflect security goals. Wong and Sycara [40] introduce security 
mechanisms to establish trust in multi-agent systems. They are 
concerned with both authenticating agents as well as ensuring 
that agents do not misbehave. Trustworthiness emerges from 
the security features in their system. 

From the definitions of trust derived from various fields 
as reviewed above, we can construct a trust metric having 
the following characteristics: (1) trust should be established 
based on potential risks; (2) trust should be context-dependent; 
(3) trust should be based on each party’s own interest (e.g., 
selfishness); (4) trust is learned (i.e., a cognitive process); and 
(5) trust may represent system reliability. 

B. Trust, Trustworthiness, and Risk 

In the literature, the terms trust and trustworthiness seem to 
be used interchangeably without clear distinction. Josang et al. 
[41] clarified the difference between trust and trustworthiness 
based on definitions provided by Gambetta [15]. Level of 
trust is defined as the belief probability varying from 0 
(complete distrust) to 1 (complete trust) [41]. In this sense, 
trustworthiness is a measure of the actual probability that the 
trustees will behave as expected. Solhaug et al. [42] define 
trustworthiness as the objective probability that the trustee 
performs a particular action on which the interests of the 
trustor depend. 

Figure 2 [42] explains how trust (i.e., subjective probability 
of trust level) and trustworthiness (i.e., objective probability of 
trust level) can differ and how the difference affects the level 
of risk the trustor needs to take. The diagonal dashed line is 
assumed to be marks of well-founded trust in which trust is 
equivalent to trustworthiness. 

Depending on the extent to which the trustor is ignorant 
about the difference between the believed (i.e., trust) and the 
actual (i.e., trustworthiness) probability, there is a miscalcu¬ 
lation of the involved risk. That is, the subjective aspect of 
trust results in incorrect risk estimation and improper risk 
management accordingly. Figure 2 shows the cases in which 
the probability is miscalculated. In the area below the diagonal 
line, there is misplaced trust to various degrees that the 
perceived trust is higher than the actual trustworthiness. Even 


though risk is an intrinsic characteristic of trust even in well- 
founded trust, misplaced trust increases risk and thus enhances 
the chance of deceit as well, as shown in the example marked 
with b in Figure 2. On the other hand, when the perceived 
trust is lower than the actual trustworthiness as shown in the 
example marked with a , the trustee is distrusted more than 
warranted. In this case, the trustor may lose potentially good 
opportunities to cooperate with partners with high trustwor¬ 
thiness. 

From the above discussions, we can conclude that careful 
risk estimation is closely linked with building accurate trust 
relations among participating entities in networks. However, 
Josang et al. [41] argue that objective trust may not be appli¬ 
cable to decision making in real situations. They define two 
interesting types of trust: 1) a context independent reliability 
trust which measures the perceived reliability by another party 
regardless of the situations which the trustor might face by 
recognizing possible risk; 2) decision trust as ’’the extent 
to which a given party is willing to depend on something 
or somebody in a given situation with a feeling of relative 
security even though negative consequences are possible.” 
Decision trust deals with components such as utility and risk 
attitude. As an example, one may not trust an old rope for 
climbing down from the 3rd floor of a building during a fire 
exercise (i.e., reliability trust) while trusting the rope in a real 
fire (i.e., decision trust). 

The relationship between trust and risk has been investi¬ 
gated in [41] [42]. Figure 3 shows an example of three different 
risk values: low, medium, and high. The value of risk is low 
for all trust values when the stake is close to zero. Similarly, 
if the stake is too high, risk is regarded as high regardless of 
the estimated trust value. Risk is generally low when the trust 
value is high. However, the risk value should be determined 
based on the value at stake (e.g., risk probability) since as 
shown in Figure 3, high risk exists even for the case of trust 
value = 1. Also important are the aspects (or probability) of 
opportunity and prospect (or the positive consequence of an 
opportunity) [41] [42]. To buy rubber is to do risky business, 
but it also gives the opportunity of selling refined products 
with net profit. The purchaser of rubber should estimate her/his 
acceptable risk level in terms of the calculated prospects. 
Josang et al. [41] and Solhaug et al. [42] conclude that trust 
is generally neither proportional nor inversely proportional to 
risk. 

Some researchers have commented that trust and uncertainty 
are intimately linked - trust is a mechanism to cope with 
uncertainty. The level of uncertainty in the information used 
as trust evidence will also considerably influence the accuracy 
of trust evaluation [43]. 

C. Trust Properties in MANETs 

Due to the unique characteristics of MANET environments 
and the inherent unreliability of the wireless channel, the 
concept of trust in MANETs should be carefully defined. 
The main properties of trust in MANET environments can 
be summarized as follows (see Figure 4): 

First, trust is dynamic , not static. Trust establishment in 
MANETs should be based on temporally and spatially local 
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Fig. 3. Risk and trust [41]. 
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Fig. 4. Trust properties in MANETs. 


information: due to node mobility or failure, information is 
typically incomplete and can change rapidly [8] [32]. Adams 
et al. [44] point out that in order to capture the dynamicity of 
trust, trust should be expressed as a continuous variable, rather 
than as a binary or even discrete-valued entity. A continuous 
valued variable can represent uncertainty better than a binary 
variable. 

Second, trust is subjective [45]. In MANET environments, 
a trustor node may determine a different level of trust against 
the same trustee node due to different experiences with the 
node derived from a dynamically changing network topology. 

Third, trust is not necessarily transitive [46]. For example, 
if A trusts B, and B trusts C, it does not guarantee A trusts C. 
In order to use the transitivity of trust between two entities to 
a third party, a trustor should maintain two types of trust: trust 
in a trustee and trust in the trustee’s recommendation of the 
third party. For example, Alice may trust Bob about movies, 
but not trust him at all to recommend other people whose 
opinion about movies is worth considering or not trust other 
people that Bob recommended as much as she trusts Bob. 

Fourth, trust is asymmetric , not necessarily reciprocal [44]. 
In heterogeneous MANETs, nodes with higher capability (e.g., 
more energy or computational power) may not trust nodes 
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with lower capability at the same level that nodes with lower 
capability trust nodes with higher capability. As a typical 
example in organizational management, a supervisor tends to 
trust an employee less than the employee trusts the supervisor. 

Fifth, trust is context-dependent [33]. For example, A may 
trust B as a wine expert but not as a car fixer. Similarly in 
MANETs depending on the given task, different types of trust 
(e.g., trust in computational power or trust in unselfishness, 
trust in forwarding versus trust in reporting) are required. 

Figure 5 shows how several trust properties are considered 
in the literature. Dynamicity and weighted transitivity are 
most often considered. However, we notice that some existing 
work does not even consider trust properly; some represent 
trust as a discrete variable, while others assume that trust 
is symmetric or completely transitive. As such they do not 
capture characteristics of trust in a MANET. Further, we could 
not find any prior work that comprehensively considers all five 
properties of trust shown in Figure 5. Note that Figure 5 is 
based on 36 papers and each work may consider multiple trust 
properties. 

In order to properly take into account these unique charac¬ 
teristics of trust in MANETs as described above, any trust- 
based framework for MANETs should consider the following 
as well: 

First, a decision procedure to determine the trust of an entity 
should be fully distributed based on cooperative evaluation 
with uncertain and incomplete evidence, since one cannot 
rely on a trusted third party such as a trusted centralized 
certificate authority to take care of trust management as in 
wired networks [8] [9] [34]. 

Second, trust should be determined in a highly customizable 
way (e.g., flexible to membership changes and to deployment 
scenarios) without causing disruption to the device computa¬ 
tion and communication resources while capturing the various 
and complicated natural components of an individual’s trust 
into a network model [34] [47]. 

Third, a trust decision framework should not assume that 
all nodes are cooperative [34]. In resource-restricted environ¬ 
ments, selfishness is likely to be prevalent over cooperation, for 
example, in order to save battery life or computational power. 
Thomas et al. [48] discuss the tradeoff between selfishness 
and altruism of participating nodes in MANETs in terms of 
prolonging system lifetime (e.g., with system lifetime defined 
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as the time to a node’s death due to energy exhaustion) versus 
reducing selfish behaviors to enhance system throughput. 

Finally, trust should be established in a self-organized 
reconfigurable way in order not to be disrupted by the dy¬ 
namics of MANET environments [8] [48]. In addition to the 
characteristics mentioned above, trust-based frameworks for 
MANETs should consider the tradeoff issues between security 
and performance including reliability, fault tolerance, scalabil¬ 
ity, and energy consumption where resources are restricted but 
security vulnerability is relatively high. 

III. Classifications, Potential Attacks, and 
Metrics for MANET Trust Management 

This section discusses classifications, attacks and perfor¬ 
mance metrics for MANET trust management. Before review¬ 
ing the literature, we would like to clarify some terminologies 
that have been used interchangeably but sometimes confus¬ 
ingly in the context of trust management. 

In general, the term trust management is interchangeably 
used with the term reputation management [35]. However, 
there is a slight difference between trust and reputation. 
According to Liu et al. [49], trust is active while reputation is 
passive. That is, trust is a node’s belief in the trust qualities 
of a peer, thus being extended from a node to its peer. 
Reputation is the perception that peers form about a node. 
Further, Ruhomaa et al. [10] distinguish trust from reputation, 
noting that trust puts an emphasis on risk and incentives 
while reputation focuses on a perception that a party creates 
through past actions about its intentions in the context of the 
norms effective within a community. Also, recommendation 
is frequently used as a way to measure trust or reputation. 
Recommendation is simply an attempt at communicating a 
party’s reputation from one community context to another 
[45] [10]. 

A working definition of trust for Internet applications, and 
a survey of trust management schemes for such applications 
may be found in [12]. 

In most of the literature, reputation management is re¬ 
garded as part of trust management. Further, the terms trust 
management and trust establishment are also interchangeably 
used. To clarify these two terms, according to Aivaloglou et 
al. [36], trust establishment is a process to deal with the 
representation, evaluation, maintenance, and distribution of 
trust among nodes. 

Trust management deals with problems such as the formu¬ 
lation of evaluation rules and policies, representation of trust 
evidence, and evaluation and management of trust relation¬ 
ships among nodes. As Figure 6 explains, trust establishment 
is one of several trust management tasks. 

A. Classifications 

According to Solhaug et al. [42], trust management is a 
special case of risk management with a particular emphasis 
on authentication of entities under uncertainty and decision 
making on cooperation with unknown entities. However, the 
application of trust management has been extended from 
authentication to various aspects of communications and 
networking, including secure routing for isolating malicious 


or selfish nodes, intrusion detection, key management, ac¬ 
cess control, and other decision making mechanisms. Trust 
management includes trust establishment (i.e., collection of 
appropriate trust evidence, trust generation, trust distribution, 
trust discovery, and evaluation of trust evidence), trust update, 
and trust revocation [50] [42]. This section surveys popularly 
used classifications of trust management (or establishment). 

Li et al. [51] and Li et al. [52] classify trust management 
as reputation-based framework and trust establishment frame¬ 
work. A reputation-based framework uses direct observations 
and second-hand information distributed among nodes in a 
network to evaluate a node. A trust establishment framework 
evaluates neighboring nodes based on direct observations 
while trust relations between two nodes without prior direct 
interactions are built through a combination of opinions from 
intermediate nodes. 

Yonfang [53] suggests two different approaches to evaluate 
trust: policy-based trust management and reputation-based 
trust management. Policy-based trust management is based 
on strong and objective security schemes such as logical rules 
and verifiable properties encoded in signed credentials for 
access control of users to resources. In addition, the access 
decision is usually on the basis of mechanisms having a well- 
defined trust management language that has strong verification 
and proof support. Such a policy-based trust management 
approach usually makes a binary decision according to which 
the requester is trusted or not, and accordingly the access 
request is allowed or not. Due to the binary nature of trust 
evaluation, policy-based trust management has less flexibility. 
Furthermore, the availability of (or access to) trusted certificate 
authorities (CA) cannot always be guaranteed, particularly 
for distributed systems such as MANETs. On the other 
hand, reputation-based trust management utilizes numerical 
and computational mechanisms to evaluate trust. Typically, in 
such a system, trust is calculated by collecting, aggregating, 
and disseminating reputation among the entities. 

According to Li and Singhal [35], trust management 
can be classified as evidence-based trust management and 
monitoring-based trust management. Evidence-based trust 
management considers anything that proves trust relationships 
among nodes: these could include public key, address, identity, 
or any evidence that any node can generate for itself or other 
nodes through a challenge and response process. Monitoring- 
based trust management rates the trust level of each partici¬ 
pating node based on direct information (e.g., observing the 
benign or malicious behaviors of neighboring nodes, such 
as packet dropping, and packet flooding leading to excessive 
resource consumption in the network, or denial of service at¬ 
tacks) as well as indirect information (e.g., reputation ratings, 
such as recommendations forwarded from other nodes). 

Aivaloglou et al. [36] classify two types of trust establish¬ 
ment frameworks for MANETs: certificate-based framework 
versus behavior-based framework. In the former, mechanisms 
are defined for pre-deployment knowledge of trust relation¬ 
ships within the network, using certificates which are dis¬ 
tributed, maintained and managed, either independently or 
cooperatively by the nodes. Trust decisions can be made based 
on a valid certificate that proves trustworthiness of the target 
node by a certificate authority or by other nodes that the issuer 
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trusts. In behavior-based framework, each node continuously 
monitors behaviors of its neighboring nodes in order to 
evaluate trust. The behavior-based framework is a reactive 
approach, operating under the assumption that the identities of 
nodes in the network are ensured by preloaded authentication 
mechanisms. For example, if a node uses network resources 
in an unauthorized way, it will be regarded as a selfish or 
malicious node, and will finally be isolated from other nodes. 

Aivaloglou et al. [36] also classify trust establishment 
schemes in terms of the type of architectures used: hierarchi¬ 
cal framework versus distributed framework. In the former, a 
hierarchy exists among the nodes based on their capabilities 
or levels of trust. In this framework, centralized certificate au¬ 
thorities or trusted third parties are usually provided for on-line 
or off-line evidence. Such a centralized infrastructure does not 
exist in a distributed framework; hence, each node has some, 
possibly equal, responsibility for acquiring, maintaining, and 
distributing trust evidence. 

Even though reputation management is part of trust man¬ 
agement, many researchers further classify reputation man¬ 
agement schemes. Adams et al. [44] propose three types of 
reputation systems: positive reputation, negative reputation, 
and a combination of the two. Positive reputation systems only 
consider observations or feedback of the positive behaviors of 
a node. Negative reputation systems only record complaints 
or observations of the negative behaviors of a node. Peers are 
assumed to be trusted and so feedback on behaviors is used 
to negatively reflect a node’s reputation. To complement the 
drawbacks of these mechanisms, hybrid reputation systems 
have been proposed [53]. For more information on reputation 
management, the readers may refer to [11]. 

B. Potential Attacks 

It is important to ensure that a trust management system 
itself should not be easily subverted, attacked or compro¬ 
mised. In this section, we discuss various common attacks 
and describe features important from the viewpoint of trust 
management. A survey of threat models and specific attacks 
on ad hoc routing protocols are described by Argyroudis et 
al. [54] and Djenouri et al. [55]. 

Liu et al. [49] describe the characteristics of attacks in 
MANETs by both the nature of attacks and the type of 
attackers. One classification of attacks is passive attack versus 
active attack. A passive attack occurs when an unauthorized 


party gains access to an asset but does not modify its content. 
Passive attacks include eavesdropping and traffic analysis (e.g., 
traffic flow analysis). Eavesdropping indicates that the attacker 
monitors transmissions of message content. Traffic analysis 
refers to analyzing patterns of data transmission. An active 
attack occurs when an unauthorized party modifies a message, 
data stream, or file. Active attacks usually take the form of 
one of the following four types or combinations: masquerad¬ 
ing (i.e., impersonation attack), replay (i.e., retransmitting 
messages), message modification, and denial-of-service (DoS) 
(leading to excessive resource consumption in the network). 

Yet another way to characterize attacks is based on the 
legitimacy of an entity in a network: insider attack versus 
outsider attack [56]. If an entity is authorized to access system 
resources but employs them in a malicious way (e.g., in a 
way not approved by the authorizer), it is classified as an 
insider attack. More specifically, inside attackers exploit bugs 
in privileged system programs or poorly configured privileges, 
and then they may install backdoors or Trojan horses or 
other such mechanisms to facilitate subsequent acquisition of 
privileged access. On the other hand, an outsider attack is 
initiated by an unauthorized or illegitimate user. They usually 
acquire access to an authorized account and try to perpetrate 
insider attacks. Both attackers may spoof network protocols 
to effectively acquire access to an authorized account. 

Many trust management schemes are devised to detect 
misbehaving nodes, both selfish nodes as well as malicious 
nodes. Specific attack examples are described as follows (the 
list is representative, not exhaustive): 

• Routing loop attacks'. A malicious node may modify 
routing packets in such a way that packets traverse a 
cycle and so do not reach the intended destination [56]. 

• Wormhole attacks : A group of cooperating malicious 
nodes can pretend to connect two distant points in the 
network with a low-latency communication link called a 
wormhole link, causing disruptions in normal traffic load 
and flow [57][58][59]. 

• Blackhole attacks : A malicious node, the so called 
black hole node, may always respond positively to route 
requests even when it does not have proper routing infor¬ 
mation. The black hole can drop all packets forwarded 
to it [60]. 

• Gray hole attacks : A malicious node may selectively drop 
packets [61], as a special case of a black hole attack. For 
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example, the malicious node may forward routing packets 
but not data packets. Similarly, a sinkhole attacker attracts 
nodes to route through it and then selectively routes 
packets [49]. 

• DoS attacks'. A malicious node may block the normal 
use or management of communications facilities, for ex¬ 
ample, by causing excessive resource consumption [62]. 

• False information or false recommendation : A mali¬ 
cious node may collude and provide false recommen¬ 
dations/information to isolate good nodes while keeping 
malicious nodes connected. In the stacking attack , a 
malicious node keeps complaining about a peer node and 
creates the peer’s negative reputation [44] [63]. 

• Incomplete information : A malicious node may not 
cooperate in providing proper or complete information. 
Usually compromised nodes collude to perform this at¬ 
tack. However, node mobility or link failure, prevalent 
in MANETs, may also result in the same phenomenon 
[8] [34]. 

• Packet modification/insertion: A malicious node may 
modify packets or insert malicious packets such as pack¬ 
ets with incorrect routing information [64]. 

• Newcomer attacks : A malicious node may discard its bad 
reputation or distrust by registering as a new user. The 
malicious node simply leaves the system and joins again 
for trust revocation, flushing out its previous bad history 
and starting to accumulate new trust [65]. 

• Sybil attacks'. A malicious node can use multiple net¬ 
work identities which can affect topology maintenance 
and fault tolerant schemes such as multi-path routing 
[61] [49] [46]. 

• Blackmailing: A malicious node can blackmail another 
node by disseminating false information that another 
node is malicious or misbehaving. This can generate 
significant amount of traffic and ultimately disrupt the 
functionality of the entire network [49]. This attack can 
be seen as false accusation plus DoS attacks in the 
sense that false information is disseminated leading to 
a significant amount of resource consumption. 

• Replay attacks'. A malicious node may replay earlier 
transmitted packets. If the packets include data, this 
should not cause trouble, and the receiving node just 
discards erroneous packets. However, if the adversary 
replays route requests, routing table information would 
become erroneous, and old locations and routing infor¬ 
mation might make nodes unreachable [56]. 

• Selective misbehaving attacks : A malicious node behaves 
badly but selectively to other nodes [66]. 

• On-off attacks'. A malicious node may alternatively be¬ 
have well and badly to stay undetected while disrupting 
services [66]. 

• Conflicting behavior attacks : A malicious node may 
behave differently to nodes in different groups to make 
the opinions from the different good groups conflicting, 
and ultimately lead to non-trusted relationships [52]. 

Figure 7 shows various attacks considered in a survey of 
43 papers. Note that the ’’general selfish” category means 
no specific information is given in the work except that it 
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Fig. 7. Attacks considered in existing trust management systems in 
MANETs. 


deals with selfish nodes. Also papers in the ’’general mis¬ 
behaving” category deal with a broad range of misbehaving 
nodes, including malicious and selfish nodes, but do not 
provide detailed information. ’’Packet related” attacks include 
packet dropping, packet modification, packet insertion, and 
selective packet forwarding. ’’Identity related” attacks include 
impersonation, masquerading, and Sybil attacks. Except for 
the ’’general selfish” and ’’general misbehaving” categories, 
we notice that ’’false information” (e.g., including false rec¬ 
ommendation or reputation) and ’’packet related” attacks are 
dominantly considered in the literature on trust management 
schemes for MANETs. Figure 7 illustrates that most of the 
attacks considered in the literature on trust management are 
general attacks often targeted at other aspects of MANETs. 
Hence, the trust evaluation engine should be robust and 
degrade gracefully if some information or evidence does not 
provide a certain level of trust based on partial or potentially 
corrupted information. 


C. Metrics for MANET Trust Management 

Although many trust management schemes have been pro¬ 
posed to evaluate trust values, no work clearly addresses what 
should be measured to evaluate network trust. Liu et al. 
[49] defined trust in their model as reliability, timeliness, and 
integrity of message delivery to the intended next-hop. Also 
most trust-based protocols for secure routing calculated trust 
values based on the characteristics of nodes behaving properly 
at the network layer. Trust measurement can be application- 
dependent and will be different based on the design goals of 
proposed schemes. 

Based on 31 papers, Figure 8 shows various performance 
metrics that have been used to evaluate trust management 
schemes for MANETs. Note that a single work may use 
multiple performance metrics. Figure 8 shows standard system 
performance metrics typically used to evaluate trust manage¬ 
ment systems; these metrics include overhead (e.g., control 
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Fig. 8. Metrics considered by MANET trust management systems. 

packet overheads), throughput, goodput, packet dropping rate, 
and delay. ’’Route usage” refers to the number of routes 
selected particularly when the purpose is for secure routing. 
’’Trust level” is a recently used system metric. Example 
metrics using the trust level include confidence level of 
the trust value, trustworthiness, opinion values about other 
nodes, and trust level per session. ’’Others” indicates metrics 
that consider system tolerance based on incorrect reputation 
threshold, availability, convergence time to reach steady state 
in trustworthiness of all participating nodes, and percentage 
of malicious nodes. 

IV. MANET Trust Management Schemes 

This section summarizes trust management schemes that 
have been developed for MANETs. 

We describe trust management schemes based on specific 
design purposes such as secure routing, authentication, intru¬ 
sion detection, access control (authorization), and key manage¬ 
ment. Further, we also describe existing general frameworks 
for trust (or reputation) evidence distribution and evaluation. 
Figure 9 summarizes 45 trust management schemes proposed 
for MANETs during 2000-2009 based on their design pur¬ 
poses. Note that under each research category, we will survey 
existing works in chronological order. 

A. Secure Routing 

Most reputation-based trust management schemes are de¬ 
vised for collaborative secure routing by detecting misbehav¬ 
ing nodes, both selfish and malicious ones. Marti et al. [67] 
proposed a reputation-based trust management scheme that 
consists of a watchdog that monitors node behaviors and a 
pathrater that collects reputation and takes response actions 
(e.g., isolating misbehaving nodes as a result of misbehavior 
detection). This work is an initiative to dynamically incorpo¬ 
rate direct observations into trust values for secure routing. It 
extends DSR (Dynamic Source Routing) but trust evaluation 
is based only on direct observations. 

Buchegger et al. [68] initiated a new design to develop 
a routing protocol by introducing a ’’trust manager” in their 
scheme. They determined trust levels based on self-monitored 


information while employing reputation collected from both 
direct and indirect observations and experiences. They did not 
show any experimental results, but pose several interesting 
questions such as what is a sustainable relationship between 
the total number of nodes in the network, the maximum 
number of malicious nodes the system can tolerate, and the 
minimum number of friends per node needed to achieve 
high tolerance, and a prescribed level of trust. Buchegger et 
al. [69] also developed a reputation-based trust management 
scheme called CONFIDANT (Cooperation Of Nodes-Fairness 
In Dynamic Ad-hoc NeTworks) based on both direct and 
indirect observations to detect misbehaving nodes. The unique 
feature in this work is an incentive mechanism for altruistic 
nodes to be paid as a result of cooperation. 

Paul and Westhoff [70] proposed a context-aware mecha¬ 
nism for detecting selfish nodes by extending DSR with a 
context-aware inference scheme to punish the accused and the 
malicious accuser. However, the use of digital signatures to 
disseminate information about the accused and the malicious 
accuser may not be viable in a resource-constrained MANET 
environment. 

Michiardi et al. [71] proposed CORE (Collaborative REp- 
utation) that has a monitoring mechanism complemented by 
a reputation functionality that differentiates between direct 
reputation, indirect reputation, and functional reputation (task- 
specific behavior). The proposed protocol is developed to 
make decisions about cooperation or gradual isolation of a 
node. A unique characteristic of this mechanism is that it 
exchanges only positive reputation information. However, this 
may limit its reliance on positive reports without the facility 
to submit negative feedback. 

He et al. [72] proposed a reputation-based trust management 
scheme using an incentive mechanism, called SORI (Secure 
and Objective Reputation-based Incentive). This scheme en¬ 
courages packet forwarding and discourages selfish behav¬ 
iors based on quantified objective measures and reputation 
propagation by a one-way hash chain based authentication. 
The performance of this scheme in the presence of malicious 
nodes, as may be expected in a hostile environment, has not 
been investigated. 

Nekkanti and Lee [73] extended AODV (Ad hoc On- 
demand Distance Vector) using trust factor and security level 
at each node. Their approach deals differently with each route 
request based on the node’s trust factor and security level. 
In a typical scheme, routing information for every request 
would be encrypted leading to large overheads; they propose 
to use different levels of encryption based on the trust factor 
of a node, thus reducing overhead. This approach adjusts 
the security level based on the recognized hostility level and 
hence can conserve resources; however, the approach does not 
treat evaluation of trust itself. Li et al. [74] also extended 
AODV and adopted a trust model to guard against malicious 
behaviors of nodes at the network layer. They represented 
trust as opinion stemming from subjective logic. The opinion 
reflects the characteristics of trust in MANETs, particularly 
dynamicity. The key feature is to consider system performance 
aspects by dealing with each query based on its level of 
trust. Depending on the level of trust of nodes involved 
in the query, there is no need for a node to request and 
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verify certificates all the time, thereby leading to significant 
reduction of computation and communication overhead. This 
work advances trust management by considering a generic 
trust management framework for MANETs. 

Pisinou et al. [75] devised a secure AODV-based routing 
protocol for multi-hop ad hoc networks for discovering a 
secure end-to-end route free of any compromised nodes. Their 
trust-based routing protocol calculates trust values based only 
on direct observations, assuming that trust is transitive. As 
a continuation of [68], Buchegger et al. [76] also proposed 
a fully distributed reputation system in order to cope with 
false information propagation. The proposed design maintains 
a reputation and trust rating system about individual nodes 
by designing a modified Bayesian approach. Recognizing the 
dynamic nature of trust and reputation, the authors introduced 
reevaluation and reputation fading as well as redemption 
mechanisms. Nevertheless, no other characteristics of trust are 
addressed except for dynamicity. 

Ghosh et al. [77] enhanced trust management by consider¬ 
ing the confidence level of trust. Their use of the confidence 
level as a weight on the computed trust value and the method 
for calculating trust in a fully distributed way provide a 
general framework that can be applied to non-trust-aware 
routing protocols. In [77], SORI [72] is extended to alleviate 
the problem of selfish nodes, by considering the number of 
forwarding packets to evaluate the confidence level. 

Wang et al. [78] proposed a mechanism to distinguish 
selfish peers from cooperative ones based solely on local 
observations of AODV routing protocol behaviors. They use a 
finite state machine model of locally observed AODV actions 
to construct a statistical description of each peer’s behavior. In 
order to distinguish between selfish and cooperative peers, a 
series of well-known statistical tests are applied to features 
obtained from the observed AODV actions. An interesting 
extension of this work would be to consider various patterns 
of node mobility which can give additional insights. 

Zouridaki et al. [79] proposed a trust establishment mech¬ 
anism for MANETs called Herms to improve the reliability 
of packet forwarding over multi-hop routes in the presence 
of potentially malicious nodes. Essentially, direct observations 
are used to evaluate opinions about others. Also, confidence 
level is used as a weight to evaluate trust of other nodes based 
on a Bayesian approach. They also introduced a windowing 
scheme to systematically expire old data to maintain accuracy 
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of the opinion metric in the face of dynamics. However, this 
scheme is vulnerable to attacks that can exploit the windowing 
scheme to disseminate false information to accuse good nodes 
and to keep bad nodes in the system (such as badmouthing 
attacks). 

As an extension, Zouridaki et al. [80] employed both 
first-hand trust information based on direct observations and 
second-hand trust information forwarded from neighboring 
nodes about non-neighboring nodes. This trust establishment 
scheme can cope with more attacks, including propagation of 
false recommendations or information, identifying bad nodes 
among neighboring nodes, colluding attacks, replay attacks, 
and duplicate attacks. It is noteworthy that they used only 
security related metrics to evaluate their scheme, such as 
trustworthiness and the percentage of nodes recognized as bad. 

Pirzada et al. [81] proposed and examined the efficacy 
of trust-based reactive routing protocols in the presence of 
attacks. This work only considers first hand information to 
evaluate other nodes’ trust values. Thus, trust evaluation is 
restricted to direct neighboring nodes. 

Sun et al. [46] proposed trust modeling and evaluation 
methods for secure ad hoc routing and malicious node de¬ 
tection. The unique part of their design is to consider trust as 
a measure of uncertainty that can be calculated using entropy. 
In their definition, trust is a continuous variable, and does not 
need to be transitive, thus capturing some of the characteristics 
of trust in MANETs. However, this work considers packet 
dropping as the only component of direct observations to 
evaluate trust. 

Abusalah et al. [82] proposed a trust-aware routing protocol 
(TARP) and developed a trust metric based on six trust com¬ 
ponents including software configuration, hardware configura¬ 
tion, battery power, credit history, exposure and organizational 
hierarchy. However, no consideration was given to trust decay 
over time and space to reflect uncertainty due to dynamics and 
incomplete information in MANET environments. 

Sen et al. [83] proposed a trust-based mechanism to de¬ 
tect malicious packet dropping nodes based on reputation of 
neighboring nodes, and take into account the decay of trust 
over time. This work assumes that a pair of public/private keys 
can be preloaded to prevent identity-related attacks. However, 
this may not be scalable for a large network. 

Soltanali et al. [84] proposed a distributed mechanism to 
deal with selfish nodes as well as to encourage cooperation in 
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MANETs based on the combination of reputation-based and 
currency-based incentive mechanism mitigating their defects 
and improving their advantages. Compared to existing works, 
this work considers more aspects of trust such as dynamicity, 
weighted transitivity, and subjectivity. However, it used only 
packet forwarding behaviors to evaluate a node’s trust and 
standard performance metrics to evaluate the proposed trust 
scheme. 

Balakrishnnan et al. [85] developed a trust model to 
strengthen the security of MANETs and to deal with the issues 
associated with recommendations. Their model utilizes only 
trusted routes for communication, and isolates malicious nodes 
based on the evidence obtained from direct interactions and 
recommendations. Their protocol is described as robust to the 
recommender’s bias, honest-elicitation, and free-riding. This 
work uniquely considered a context-dependency characteristic 
of trust in extending DSR. 

Li et al. [52] stated that using only a reputation-based trust 
framework gives only an incomplete partial solution for trust 
management. They proposed an objective trust management 
framework (OTMF) for MANETs based on both direct and 
indirect information for reputation management and showed 
the effectiveness of OTMF. This work used the term ”ob- 
jective” trust to refer to trust evaluated based on second¬ 
hand information. However, this work did not consider node 
collusion in obtaining second-hand information, which may 
lead to incorrect recommendations. 

Mundinger and Boudec [86] were the first to analyze the 
robustness of a reputation system based on a deviation test. 
Using a mean-field approach in their stochastic model, they 
showed that liars have no impact unless the number of liars 
exceeds a certain threshold (a phase transition). They provided 
precise formulas for the critical values and guidelines for an 
optimal choice of parameters. This work is unique in that it 
evaluates a system’s tolerance to untrusted nodes; however, the 
reputation evaluation is based only on the ’’fake” information. 

Moe et al. [87] proposed a trust-based routing protocol as 
an extension of DSR based on an incentive mechanism that 
enforces cooperation among nodes and reduces the benefits 
that selfish nodes can enjoy (e.g., saving resources by selec¬ 
tively dropping packets). This work is unique in that they used 
a hidden Markov model (HMM) to quantitatively measure the 
trustworthiness of nodes. In this work, selfish nodes are benign 
and selectively drop packets. Performance characteristics of 
the protocol when malicious nodes perform active attacks 
such as packet modifications, identity attacks, etc., need to 
be investigated further. 

In quorum or threshold schemes, a node must successfully 
interact with at least k of n distributed trusted authority (TA) 
nodes. Finding k such nodes can be resource intensive. Reidt 
et al. [88] prioritize the TA nodes and find a route to connect 
to k desirable TA nodes so as to minimize a performance 
metric such as overhead, taking into account reliability and 
energy consumption of individual nodes. Significant savings 
over a standard system were shown. An interesting aspect, not 
considered yet, would be to incorporate trustworthiness into 
the TA selection and routing scheme. 

Ayachi et al. [89] formalized implicit trust relations in 
AODV and demonstrated that a node can utilize these trust 


relations to isolate malicious nodes for secure routing. Nodes 
overhear neighbors’ transmissions from which they can build 
a neighbor routing table and check for deviation from nor¬ 
mal behaviors for AODV. This scheme can detect malicious 
behaviors such as message replication, message forgery and 
some instances of message modification. However, it is not 
amenable to incorporation of other trust metric components, 
such as intimacy and competence but monitored behaviors 
could feed into a trust evaluation scheme. 

Adnane et al. [90] proposed trust-based countermeasures 
to isolate malicious nodes extending OLSR (Optimized Link 
State Routing). Their protocol provides secure routing paths 
by identifying malicious nodes. The focus of the protocol is 
to prevent usurpation of node identities. Performance analysis 
under other types of attacks remains to be investigated. 

Although many researchers have developed secure routing 
protocols using trust, most of the approaches have focused on 
monitoring routing behaviors and the evaluation of trust has 
been in the context of communication networks. Further steps 
should be taken to refine issues such as (1) how to quantify 
trust in a MANET node; (2) how to employ (a continuous¬ 
valued) trust in a routing decision; and (3) how to develop a 
composite trust metric incorporating task performance goals, 
taking into account the social aspects of a MANET node. 

B. Authentication 

There have been efforts to establish trust relationships to 
ensure authentication in MANETs. Weimerskirch et al. [91] 
developed a trust model based on human behavior, noting that 
society can be properly considered as an ad hoc network. 
They used recommendations from a distributed trust model to 
construct trust relationships and extended it by a request for 
recommendations. Based on models derived from observations 
of human society, recommendations are used to calculate trust, 
with weights based on the distance of relationships. Their 
definition does not assume symmetry or complete transitivity, 
thus capturing essential features of trust in MANETs. The 
assumption of low-value transactions does not require any 
evidence-based mechanism to ensure trust such as authen¬ 
tications using public/private keys. Consequently, it is not 
applicable to systems where hostility may be high, or where 
consequences of misplaced trust can be severe. 

Verma et al. [92] presented an overview of a trust negoti¬ 
ation scheme using DSR and ZRP (Zone Routing Protocol). 
Their scheme consists of two components. The peer-to-peer 
component deals with secure communications with neighbors 
in a lightweight manner. The heavyweight remote component 
performs trust negotiation and establishes secure end-to-end 
communication. The main goal of this work is to add ro¬ 
bustness in the process of trust negotiation, rather than trust 
evaluation. 

Pirzada and McDonald [93] proposed a trust-based com¬ 
munication model that, based on a notion of a belief\ pro¬ 
vides a dynamic measure of reliability and trustworthiness 
in MANETs. The merit of this work is to incorporate utility 
as general trust and time as situational trust into the overall 
trust metric to evaluate an agent in the network. However, the 
situational trust considered is limited to monitoring dynamics 
of packet forwarding behaviors. 
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Davis [47] proposed a reliable and structured hierarchical 
model for trust management in MANETs that is robust to 
malicious accusation exploits. The scheme deals with explicit 
revocation of certificates in a distributed way, eliminating the 
case in which revoked certificates can be accepted as valid., 
This work assumes that the initial certificates and public keys 
of all nodes are distributed by a centralized trust authority to 
each node before the network is deployed which may not be 
scalable in a large scale MANET. The paper does not discuss 
the issue of false positives which can lead to continual eviction 
of nodes, and eventually loss of network connectivity. To 
counteract this, dynamic reissue of certificates may be needed 
which may incur extra overhead. 

Ngai and Lyu [94] proposed a secure public key authentica¬ 
tion service based on their trust model to prevent propagation 
of false public keys in the presence of malicious nodes. 
Trust is evaluated based on direct monitoring as well as 
recommendation. However, this work does not consider group 
membership changes, the distance from the evaluator, and their 
effect on the performance of their trust management scheme. 

In summary, there has been quite a bit of work on using 
trust for authentication. However, as in the case of trust-based 
secure routing, the models and protocols used are based solely 
on monitoring packet forwarding behaviors. 

C. Intrusion Detection 

Trust can be used as a basis for developing an intrusion 
detection system (IDS). Also, IDS itself can help nodes 
measure trust of other nodes when they cooperate with each 
other to detect malicious nodes. Albers et al. [95] proposed 
a general architecture for an intrusion detection system (IDS) 
called a Local IDS (LIDS) such that intrusion detection can 
be performed locally among trustworthy participating nodes. 
Here, trust is used to detect intrusions in the system. In Ahmed 
et al. [96], IDS provides audit and monitoring capabilities 
that offer local security to a node and helps perceive the 
specific trust levels of other nodes. Hence, evaluating trust and 
identifying intrusions may not be totally separated processes. 

D. Access Control 

Trust also can be applied in determining whether or not to 
grant access to certain resources or rights. Gray et al. [97] 
integrated trust-based admission control with standard role- 
based access control. By doing this, an access control decision 
is effectively made without being affected by incomplete infor¬ 
mation collected in MANETs. A simple distributed blackjack 
card game application is described, in which the trust-based 
admission control system is used to assign roles to users based 
on their trust-based admission rights. It is not clear how the 
approach can be extended to a general framework applicable 
to MANETs. 

Luo et al. [98] presented a ubiquitous and robust access 
control solution (URSA) for MANETs based on a localized 
group trust model so that only well behaving nodes will have 
access rights to network resources. Their localized group trust 
model for MANETs is based on threshold cryptography: a 
node is globally trusted only if it is individually trusted by 
any k trusted nodes where k is a system-wide trust threshold. 


This work assumes that the node density is large enough so 
that any node can find k trusted nodes, perhaps by moving 
to another location. Interesting extensions of the work include 
consideration of mobility models other than random waypoint, 
and trust evaluation under high node mobility situations. 

Adams and Davis [17] presented a decentralized access 
control system implementing sociological trust constructs in 
a quantitative system to evaluate the relationships between 
entities. A distributed, node-centric approach to reputation 
management considers a node’s behavior feedback and gives a 
reputation index that nodes can use to determine the trustwor¬ 
thiness of their peers before establishing trust relationships. 
This work further assessed risk using a Bayesian approach 
to evaluate trust. Interestingly, this work used reputation as 
a weight to evaluate direct observations, which is a different 
approach from most existing works. Extensions of the scheme 
to handle network dynamics would be useful. 

Yunfang [53] proposed an integrated mechanism of policy 
proof and reputation evolution into trust management for 
decision-making on access control with the goal of providing 
firm/objective security as well as social/subjective security. 
However, this work is based on the assumption that trust 
is completely transitive, and it is not clear how a more 
realistic transitivity model can be incorporated into the trust 
management system. 

E. Key Management 

Virendra et al. [99] proposed a trust-based security archi¬ 
tecture for key management in MANETs. This architecture 
aims to establish keys between nodes based on their trust 
relationships, and to build secure distributed control using trust 
as a metric. In their self-organizing trust-based architecture, 
nodes are organized into trust-based clusters called Physical- 
Logical Trust Domains (PLTDs), a group of trusted nodes 
sharing a group key. Nodes can belong to multiple PLTDs. 
The unique part of this work is that it considers the trust level 
of each node in a physical as well as a logical sense, e.g., 
it considers both one-hop nodes as well as previously trusted 
nodes that are not currently one-hop neighboring nodes. The 
significant merit of this work is in formalizing a trust metric 
reflecting trust decay over time and updating trust as dynamics 
of the network change. However, establishing pair-wise keys 
based on pair-wise trust may not be feasible in terms of 
scalability and in the presence of high network dynamics in a 
large MANET. 

Hadjichristofi et al. [63] presented a key management 
framework that provides redundancy and robustness in the 
establishment of Security Association (SA) between pairs of 
nodes. Their proposed key management system (KMS) adopts 
a modified hierarchical Public Key Infrastructure (PKI) model 
where nodes can dynamically take management roles. The 
scheme is designed to provide high service availability based 
on trust-based SA among nodes. However, trust relationships 
are derived solely from certificate chains. Adams et al. [44] 
also extended their prior work [63] with a node-centric rep¬ 
utation management approach that considers feedback about 
a node’s behavior in generating a reputation index to de¬ 
termine the trustworthiness of its peers before establishing 
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IPSec security associations. They considered the decay of trust 
over time using a three-window weighted average. They also 
derived reputation values from past experiences and current 
observations and introduced a rehabilitation mechanism to 
give a second chance to bad nodes. However, no details 
were given on the type of information that should be directly 
observed to derive reputation. 

Li et al. [100] demonstrated an on-demand, fully local¬ 
ized, and hop-by-hop public key management protocol for 
MANETs. In this protocol each node generates its own 
public/private key pairs, issues its certificate to neighboring 
nodes, keeps received certificates in its certificate repository, 
and provides authentication service by adapting to the dynamic 
network topology, without reliance on any centralized server. 
However, only certificate chains are used to derive trust. 

Chang and Kou [101] proposed a Markov chain trust model 
to obtain the trust values (TVs) for 1-hop neighbors. They de¬ 
signed a trust-based hierarchical key management scheme by 
selecting a certificate authority server (CA) and a backup CA 
with the highest TVs. This work gives a rigorous analysis of 
TVs and considers a variety of attacks. However, it computes 
TVs only based on direct observations and does not consider 
trust decay due to using recommendations from remote nodes. 

A survey of key management techniques for network-layer 
security may be found in the work by Hegland et al. [102]. 

In contrast to secure routing that produces an operational 
MANET, authentication, intrusion detection, access control, 
and key management are general trust contexts that also exist 
outside the area of MANETs. In these applications, it is 
useful to abstract out the properties of MANETs and consider 
only the influence of MANETs on any information/evidence 
gathering, aggregation, and other computation, and design a 
trust management scheme that considers influences such as 
the cost/likelihood of obtaining a piece of information in 
computing trust. 

F. Trust Evidence Distribution and Evaluation 

Several trust management schemes have been proposed 
in order to provide a general framework for trust evidence 
distribution or evaluation in MANETs. 

Yan et al. [64] proposed a trust evaluation based security 
solution for data protection, secure routing, and other network 
activities. This trust evaluation model called Personal Trusted 
Bubble (PTB) considers many factors including experience 
statistics, data value (the higher the value of the data, the 
higher is the trust needed from other PTBs to transfer it), 
intrusion black list, reference (reputation/recommendation), 
personal preference, and PTB policy (related to the entire 
network’s security requirements and policy). Interestingly, per¬ 
sonal preference and PTB reflect the subjective characteristic 
of trust in deriving trust values. Yan et al. [64] do not 
validate whether their proposed trust management is correct 
or useful compared to the actual trust levels, say, based on 
trustworthiness in Josang and Solhaug’s terminology. In gen¬ 
eral, validation of trust models is difficult, given the inherent 
subjectivity in the trust metric, but it is also critical. Jiang 
and Baras [103] proposed a trust distribution scheme called 
ABED (Ant-Based trust Evidence Distribution) based on the 
swarm intelligence paradigm, which is highly distributed and 


adaptive to mobility. The swarm intelligence paradigm is 
widely used in dynamic optimization problems (e.g., the trav¬ 
eling salesman problem, routing in communication networks). 
The key principle in swarm interaction is called stigmergy, 
indirect communication through the environment. In ABED, 
’’pheromones” are deposited at nodes by mobile agents called 
’’ants” and provide the mechanism for information exchange 
and interactions. These ’’ants” can identify the optimal path 
toward their food, resembling trust evidence in this case. 
The pheromone regulation process is known to be suitable 
for dynamically changing environments such as MANETs. 
However, no specific attackers are considered to prove the 
robustness of the proposed scheme in the presence of attacks. 

In the continuing work, Baras and Jiang [104] addressed 
distributed trust computation and establishment using random 
graph theory. This work uses the theory of dynamic coop¬ 
erative games and identifies how a phase transition from a 
distrusted state to a trusted state can occur in a dynamic 
MANET. This work is unique in that it describes how phase 
transitions occur in MANETs and how these are related to 
node mobility and network topology in the process of initial 
trust establishment. Trust relationships are ternary (yes, no, 
don’t care) and the emphasis is on understanding steady-state 
behaviors. Incorporating continuous valued trust variables, 
dynamics, and transient behaviors in this framework would 
be useful. 

Theodorakopoulos and Baras [50] proposed a trust evidence 
evaluation scheme for MANETs. The evaluation process is 
modeled as a path problem in a directed graph where vertices 
represent entities and edges represent trust relations. The 
authors employed the theory of Semirings to show how two 
nodes can establish trust relationships without prior direct 
interactions. Their case study uses the PGP web of trust 
to express an example trust model based on Semirings and 
shows that their scheme is robust in the presence of attackers. 
However, their work assumes that trust is transitive. Further, 
trust and confidence values are represented as binary rather 
than continuous values. Even though no centralized trusted 
third party exists, their work makes use of a source node 
as a trusted infrastructure, which introduces vulnerability in 
MANETs. 

Recently, Boukerche and Ren [105] proposed a distributed 
reputation management mechanism called GRE (Generalized 
Reputation Evaluation), using a comprehensive computational 
reputation model. GRE seeks to prevent malicious nodes from 
entering a trusted community. However, no specific attack 
model was addressed. 

Moloney and Weber [106] presented a trust-based security 
system that generates appropriate trust levels based on the 
consideration of the main characteristics of MANETs as 
well as context-awareness. The scheme leverages two existing 
projects at Trinity College, Dublin, called SECURE and Aithe. 
SECURE is used for trust management using a trust engine 
and a risk engine while Aithe collects and manages context 
information forwarded from sensors. It is worthwhile to extend 
this work to consider attacks that can propagate incorrect 
information to generate trust levels. 

Very recently, Cho et al. [107] proposed a trust management 
scheme for group communication systems in MANETs. This 
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work proposed a composite trust metric reflecting various 
aspects of a MANET node such as sociability (i.e., social 
trust) and task performance capability (i.e., QoS trust), and 
investigated the effect of the trust chain length used by a 
node to establish acceptable trust levels through subjective 
trust evaluation. They also discussed the concept of objective 
trust evaluation based on global knowledge as the basis of 
validating subjective trust evaluation. More work remains to 
be done to ascertain feasibility. 

The Appendix summarizes trust management schemes sur¬ 
veyed in this section. In the Appendix, the methodology 
explains how trust evidence is collected and performance 
metrics refer to the metrics used to evaluate various trust 
management schemes. 

V. Future Research Discussion 

It is clear that sooner or later intelligence will be em¬ 
bedded in each node with cognitive functionality, adopting 
recent ideas about cognitive networks in wireless networks 
[108]. Mahmoud [108] defines a cognitive network as having 
a cognitive process that is capable of perceiving current 
network conditions and then planning, deciding, and acting 
on those conditions. Cognitive networks are able to recon¬ 
figure the network infrastructure based on past experiences 
by adapting to continuously changing network behaviors to 
improve scalability (e.g., reducing complexity), survivability 
(e.g., increasing reliability), and QoS (e.g., facilitating co¬ 
operation among nodes) as proactive mechanisms [48] [108]. 
We suggest using this concept of cognitive networks so that 
nodes can adapt to changing network behaviors, such as 
attacker behaviors, degree of hostility, node disconnection due 
to physical environment such as terrain, energy depletion, or 
voluntary disconnection for energy saving. Cognition is more 
than adaptation; it incorporates learning and reasoning. 

Another potentially fruitful research direction is to use 
social relationships in evaluating trust among collaborators in 
a group setting by employing the concept of social networks. 
Golbeck et al. [37][38][39] define a social network as a 
social structure of individuals who may be related directly or 
indirectly to each other in order to pursue common interests. 
Yu et al. [109] and Maheswaran et al. [110] use social 
networks to evaluate the trust value of a node. Examples 
of social networks are strong social relationships including 
colleagues or relatives, membership in the same platoon, and 
loose social relationships including school alumni or friends 
with common interests or membership in coalition activities. 
Social trust may include friendship, honesty, privacy, and 
social reputation or recommendation derived from direct or 
indirect interactions for ’’sociable” purposes. In MANETs, 
metrics used to measure these social trust properties can be 
frequency of communications, malicious or benign behaviors 
(e.g., false accusation or recommendation, impersonation), 
private information revealed, and quality of reputation. The 
notion of social trust is being incorporated into communication 
networks. Trust propagation models, some based on notions of 
social networking, have been proposed in multi-agent systems 
[114] [115] [116]. 

An important and interesting research direction is to con¬ 
struct a composite trust metric based on social trust and 


other trust components representing quality-of-service (QoS) 
to successfully perform tasks to meet both performance and 
trust requirements. We have seen some work in the literature 
moving in this direction. Cho et al. considered honesty and 
intimacy (for social trust), and unselfishness and energy (for 
QoS trust) for trust evaluation [107]. Kohlas et al. [Ill] used 
honesty, competency, reliability, and maliciousness and their 
corresponding negations as trust components to define trust 
relationships. Yin et al. [112] computed composite reputation 
values of peers based on evidences from various domains 
such as customers’ reputation scores or ranks in commercial 
sites or the certified roles in certain organizations with dif¬ 
ferent weights indicating the importance and robustness of 
the reputation computation processes. Boursas and Hommel 
[113] considered QoS aspects such as the visual quality in 
multimedia and commitment in interactions to calculate node 
trust levels in large distributed systems. More work remains 
to be done to understand the best combination of social trust 
versus QoS trust components used to construct the composite 
trust metric, as well as the best weights associated with social 
trust and QoS trust, especially when given application context 
information for critical mission executions in MANETs. 

Not much work has been done in trust management for 
mobile vehicular systems. A trust architecture for vehicular 
networks is proposed in [117] that incorporates a policy 
control model, a proactive trust model, and a social network 
based system, and takes into account dynamics. When the 
environment is volatile, associating trust with data becomes 
even more challenging; a solution is provided in [117] and a 
case study is discussed in the context of vehicular networks. 

The overall qualities of trust in decision making de¬ 
pends on complex interactions between the information, so¬ 
cial/cognitive, and communications networks. Trust metrics 
might be separately defined in each of the networks, but 
the key issue is to elucidate the mapping of qualitative and 
quantitative metrics across the networks, to define an end- 
to-end notion of composite trust, to determine the attributes 
(presumably many others than trust) in the different networks 
that affect this composite metric, and identify those that can 
be controlled and those that cannot [118], especially for trust 
management in a coalition environment [119]. 

We suggest that the following design concepts be considered 
for building MANET trust management systems: 

• A trust metric must reflect the unique properties of trust 
in MANETs, including possibly imperfect transitivity, 
asymmetry, subjectivity, non-binary nature, decay over 
time and space, dynamicity, and context-dependency. 

• A trust metric must incorporate adequate trust compo¬ 
nents (e.g., social trust and QoS trust) capable of reflect¬ 
ing mission difficulty (e.g., high risk upon task failure), 
changing network environments (e.g., lack of bandwidth, 
increasingly hostile environment as attackers’ strength 
increases, high communication load), and conditions of 
participating nodes (e.g., low energy, compromised sta¬ 
tus). 

• A trust management design must support cognitive func¬ 
tionality for each node to achieve adaptability to changing 
network conditions and MANET environments including 
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node density, node mobility patterns, scheduling algo¬ 
rithms, and traffic patterns. 

• A trust management system should be situation specific 
or situation aware [120] [121] [122]. Situational awareness 
includes mission contexts and requirements in terms of 
security, performance and reliability. Depending on the 
required levels of security, performance and/or reliability, 
a different level of trust can be adopted reflecting mission 
contexts and situations. 

• A trust metric must adequately reflect tradeoffs in altru¬ 
ism versus selfishness, trust versus reliability, availability, 
survivability, or security so as to contribute to improved 
system performance. In addition, since gathering infor¬ 
mation from spatially remote areas will consume more 
resources (e.g., time or energy) but improve decision 
making, one should investigate the tradeoff between 
resource consumption and decision making accuracy and 
timeliness. One may utilize aggregation technique to 
reduce resource consumption in obtaining information 
from distant nodes. 

• A trust management design must allow optimal settings 
to be identified under various network and environmental 
conditions so as to maximize the overall trust of the sys¬ 
tem for successful mission executions. Equally important 
is an understanding of sensitivity to deviations from the 
optimal settings. 

• There has been no comparison of trust management 
schemes versus conventional security schemes in terms 
of metrics of interest in MANETs. One example could 
be the comparison of trust management schemes to 
cryptographic schemes in detecting misbehaving nodes. 

• Local trust is easy to understand and compute, since it 
only involves tracking behaviors of neighboring nodes. 
Local trust is easy to defend from malicious attacks. 
Global trust is harder to compute and update; Eigentrust 
[123] is an example of a global trust metric. But a 
non-local definition of trust is subject to subversion and 
manipulation by colluding nodes. Zhang et al. [124] 
provide a robust version of the Eigentrust algorithm. A 
critical question is: is trust inherently local? How can a 
global trust metric be computed and distributed reliably? 

• Recently, social trust derived from social networks has 
received considerable attention for establishing trust in 
various applications. MANET designers may also want 
to take into account social trust. 

• The survey has focused on a trust value associated with 
individual nodes. But often we may be interested in 
associating trust with data or with a group of nodes or 
entities. Many of the concepts discussed here will extend 
naturally. 

VI. Concluding Remarks 

Trust is a multidimensional, complex, and context- 
dependent concept. Although trust-based decision making is 
in our everyday life, trust establishment and management 
in MANETs face challenges due to the severe resource 
constraints, the open nature of the wireless medium, the 
complex dependence between the communications, social and 
application networks, and, hence, the complex dependency 


of any trust metric on features, parameters, and interactions 
within and amongst these networks. 

In this paper, we surveyed and analyzed existing trust 
management schemes in MANETs to provide MANET trust 
network protocol designers with multiple perspectives on the 
concept of trust, an understanding of trust properties that 
should be observed in developing trust metrics for evaluating 
trust, and insights on how a trust metric can be customized 
to meet the requirements and goals of the targeted system. A 
composite trust metric that captures aspects of communica¬ 
tions and social networks, and corresponding trust measure¬ 
ment, trust distribution, and trust management schemes are 
interesting research directions. For dynamic networks, such 
as military MANETs, these schemes should have desirable 
attributes such as ability to adapt to environmental dynamics, 
scalability, reliability, and reconfigurability. 
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Appendix 

A SURVEY ON EXISTING TRUST MANAGEMENT SCHEMES 

IN MANETs 

See Tables I-VI. 
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TABLE I 


Authors, Year, 
Ref. no 

Purpose 

Methodology 

Attacks consid¬ 
ered 

Performance 

metrics 

Trust property 

Trust management model 

Marti (2000) 

[67] 

Secure routing 

Direct observation 
Reputation 

Black hole 

False accusation 

Throughput 
Overhead 
Detection accu¬ 
racy 

Dynamicity 

Watchdog and Pathrater 

Trust revocation introducing 
redemption 

Underlying routing protocol: 
DSR 

Buchegger & 

Boudec (2002) 
[68] 

Secure routing 

Direct observation 
Reputation 

Various 

malicious packet 
forwarding 

DoS 

N/A 

Weighted tran¬ 
sitivity 
Dynamicity 

An extension of DSR using 
a hybrid scheme of selective 
altruism and utilitarianism 

Buchegger 
& Boudec 

(2002) [69] 

Secure routing 
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Reputation 

Forward 

defection (e.g., 
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Dropped packets 
Overhead 

Utility* 3 

Weighted tran¬ 
sitivity 
Dynamicity 

CONFIDANT 

Bayesian Model 

Incentive mechanism 
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behaving nodes 

An extension of DSR 

Paul & Westhoff 
(2002) [70] 

Secure routing 

Direct observation 
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Packet modifica¬ 
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N/A 
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Contextaware Inference 

Mechanism 
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Michiardi & 

Molva (2002) 
[71] 
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Extension of [67] 
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He et al. 

(2004) [72] 
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Nekkanti & Lee 
(2004) [73] 

Secure routing 
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General outside 
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Average 
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Packet delivery 
ratio 

Trust value 

ranges from 
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integer 

Trust-based adaptive AODV 

Li et al. (2004) 
[74] 
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^“Weighted transitivity” means that trust is transitive only with a weight reflecting decaying of trust depending on confidence level, credibility, reliability, 
time, reputation, distance, etc. 


[11] S. Ruhomaa, L. Kutvonen, and E. Koutrouli, “Reputation Management 
Survey,” 2nd Inf l Conf. on Availability, Reliability, and Security, 10-13 
Apr. 2007, Vienna, Austria, pp. 103-111. 

[12] T. Grandison and M. Sloman, “A Survey of Trust in Internet Applica¬ 
tions,” IEEE Commun. Surveys and Tutorials, vol. 3, no. 4, pp. 2-16, 
2000. 

[13] J. H. Cho and A. Swami, “Towards Trust-based Cognitive Networks: 
A Survey of Trust Management for Mobile Ad Hoc Networks,” 14th 
Infl Command and Control Research and Technology Symposium, 
Washington D.C. 15-17 June 2009. 

[14] Merriam Webster’s Dictionary [Online]: 

http://www.merriamwebster.com/dictionary/trust%5B 1 %5D. 

[15] D. Gambetta, ’’Can We Trust Trust?” Trust: Making and Breaking 
Cooperative Relations, Basil Blackwell, Oxford, 1990, pp. 213-237. 

[16] N. Luhmann, Trust and Power, Wiley, 1979. 

[17] W. J. Adams, N. J. Davis, “Toward a Decentralized Trust-based Access 
Control System for Dynamic Collaboration,” Proc. 6th Annual IEEE 
SMC Information Assurance Workshop, 15-17 June, 2005, West Point, 
NY, pp. 317-324. 

[18] T. Harford, “The Economics of Trust,” [Online]: 
http://www.forbes.com/2006/09/22/trust-economy-markets- 
tech_cx_th_06trust_0925harford.html 

[19] H. S. James, “The Trust Paradox: A Survey of Economic Inquiries 
into the Nature of Trust and Trustworthiness,” Journal of Economic 
Behavior and Organization, vol. 47, no. 3, 2002. 

[20] A. B. MacKenzie and S. B. Wicker, “Game Theory and the Design of 


Self-Configuring, Adaptive Wireless Networks,” IEEE Commun. Mag., 
vol. 39, no. 11, pp. 126-131, Nov. 2001. 

[21] R. Axelrod, ’’The Evolution of Cooperation,” Science, vol. 211, no. 
4489, pp. 1390-1396, March 1981. 

[22] M. Srivatsa, S. Balfe, K. G. Paterson, and P. Rohatgi, “Trust Man¬ 
agement for Secure Information Flows,” Proc. 15th ACM Conf. on 
Computer and Communications Security, Alexandria, VA, Oct. 2008, 
pp. 175-188. 

[23] Stanford Encyclopedia of Philosophy, Feb. 20, 2006 [Online]: 
http ://plato. Stanford. edu/entries/trust/ 

[24] B. Lahno, “Olli Lagerspetz: Trust. The Tacit Demand,” Ethical Theory 
and Moral Practice, vol. 2, no. 4, pp. 433-435, Dec. 1999, Published 
Springer Netherlands. 

[25] Wikipedia-Trust in Social Science, 16 Sept. 2009 [Online]: 
http://en.wikipedia.org/wiki/Trust_(sociology) 

[26] M. Deutsch, The Resolution of Conflict: Constructive and Destructive 
Processes, Carl Hovland Memorial Lectures Series, New Haven and 
London: Yale University Press, 1973. 

[27] R. Hardin, “The Street-Level Epistemology of Trust,” Politics and 
Society, vol. 21, no. 4, 1993, pp. 505-529. 

[28] J. B. Rotter, “Interpersonal Trust, Trustworthiness, and Gullibility,” 
American Psychologist, vol. 35, no. 1, Jan. 1980, pp. 1-7. 

[29] D. McKnight and N. Chevany, “The Meanings of Trust,” Carlson 
School of Management, University of Minnesota, Technical Report 
TR 94-04, 1996. 

[30] F. D. Schoorman, R. C. Mayer, and J. H. Davis, “An Integrative 



















206 


TABLE II 


Authors, Year, 
Ref. no 

Purpose 

Methodology 

Attacks consid¬ 
ered 

Performance 

metrics 

Trust property 

Trust management model 

Pisinou et al. 
(2004) [75] 

Secure routing 

Direct observation 

Black hole 

Route injection 
Selfish nodes 

Route overhead 
Algorithm 
running time 
Number of 

routes selected 
Rout errors sent 

Transitivity 

Dynamicity 

An extension of AODV 
called Trust-embedded 

AODV 

Buchegger & 

Boudec (2004) 
[76] 

Secure routing 

Direct observation 
Reputation 

False information 
propagation 

Mean detection 
time for misbe¬ 
having nodes 

False alarm 

Dynamicity 

Bayesian Model 

Trust revocation introducing 
redemption 

Ghosh (2005) 
[77] 

Secure routing 
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Black hole 
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DoS 

Overhead 

Routes selected 
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Wang et al. 
(2005) [78] 

Secure routing 
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TABLE III 


Authors, Year, 
Ref. no 

Purpose 

Methodology 

Attacks consid¬ 
ered 

Performance 

metrics 

Trust property 

Trust management model 

Abusalah et al. 
(2006) 

[81] 

Secure routing 

Direct observation 
on forwarding 

packets 

Recommendation 

Evidences 

Packet dropping 

Routing 

overhead 

Route discovery 
time 

N/A 

TARP (trustaware routing 
protocol) 

Sen et al. (2006) 
[83] 

Secure routing 

Direct observation 
Evidences 

False accusation 
Packet dropping 
Tampering 

Packet dropping 
rate 

Reputation level 

Dynamicity 

Group trust value is used 
based on reputations for 
neighboring nodes 

Soltanali et al. 
(2007) 

[84] 

Secure routing 

Reputation only 
based on direct 
and indirect 

observation 

Selfish nodes 

Throughput 

Detection 
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Dynamicity 
Weighted tran¬ 
sitivity 
Subjectivity 

An extension of DSR with 
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reputation manager, opinion 
manager, routing and for¬ 
warding manager 

Balakrishnnan et 
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Secure routing 
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Recommendation 
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Routing 
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modification 
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recommendation 
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Weighted 

transitivity 
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Secure routing 
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Modified Bayesian model 
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Selective packet 
dropping by self¬ 
ish nodes 
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TABLE IV 


Authors, Year, 
Ref. no 

Purpose 

Methodology 

Attacks consid¬ 
ered 

Performance 

metrics 

Trust property 

Trust management model 

Adnane et al. 
(2009) [90] 

Secure routing 

Direct & indirect 
information 
Evidences (PKI) 

Fake identity 
General 

malicious nodes 

N/A 

N/A 

Extension of OLSR 

No specific trust metric 
shown 

Ayachi et al. 
(2009) [89] 

Secure routing 
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Message 
replication/ 
forgery /modificatioi 

N/A 

L 

N/A 

Extension of AODV 

No specific trust metric 
shown 

Weimerskirch et 
al. (2001) [91] 

Authentication 

Recommendation 

References 

Packet modifica¬ 
tion 

Breach of confi¬ 
dentiality 

DoS 

N/A 

Weighted tran¬ 
sitivity 

Distributed Trust Model 

Verma et al. 
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Authentication 
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Indirect 

information 

General inside 
and outside 

attackers 

N/A 
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An extension of DSR and 
ZRP 

Pirzada & 
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[47] 
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validation, and revocation 
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Ngai & Lyu 
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False certificate 
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TABLE V 


Authors, Year, 
Ref. no 

Purpose 

Methodology 

Attacks consid¬ 
ered 

Performance 

metrics 

Trust property 

Trust management model 

Adams & Davis 
(2005) [17] 

Access control 

Direct observation 
Reputation 

General 

misbehaving 

nodes 

N/A 

Conditional 

transitivity 1 

Trust is a 

continuous 

value 

Bayesian Model for risk as¬ 
sessment 

Yunfang (2007) 
[53] 

Access control 

Direct observation 

Reputation 

Policybased 

General 

misbehaving 

nodes 

N/A 

Weighted tran¬ 
sitivity 

Integrated (policy-based plus 
reputation-based) adaptive 
trust management 

Virendra et al. 
(2005) [99] 

Key 

Management 

Direct/indirect ob¬ 
servation 

False key 

generation 
Compromise of 
other nodes’ keys 

N/A 

Weighted tran¬ 
sitivity 
Subjectivity 
Dynamicity 

Trust is a con¬ 
tinuous value 

Physical-Logical Trust Do¬ 
mains (PLTDs) 

Hadjichristoh et 
al. (2005) [63] 

Key 

Management 

Direct observation 
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Stacking attack 
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malicious nodes 

% of available 
nodes with 

capability 
of service 

provision 

Average shortest 
path 

% of isolated 
nodes 

Success ratio to 
obtain system 
service 

Trust ranges 
from 1 to 100 
Dynamicity 

A modified hierarchical trust 
PKI model 

Adams et al. 
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[17] 
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We study a decision making model in a condition where it is equivalent to the two-dimensional Ising model, 
and we show that at the onset of phase transition it generates temporal complexity, namely, nonstationary and 
nonergodic fluctuations. We argue that this is a general property of criticality, thereby opening the door to 
the application of the recently discovered phenomenon of complexity matching: For an efficient transfer of 
information to occur, a perturbing complex network must share the same temporal complexity as the perturbed 
complex network. 
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I. INTRODUCTION 

Phase transitions and critical phenomena occur frequently 
in nature and have been widely studied by physicists, see 
for instance [1]. The Ising model [2] originally introduced to 
explain ferromagnetic phase transition is well known, and the 
exact solution found by Onsager [3] for the occurrence of phase 
transition in the two-dimensional case is widely recognized as 
an example of outstanding theoretical achievement. In the last 
few years some scientists have used the Ising model to shed 
light on biological and neurophysiological processes [4-7]. 
More precisely, the authors of [4] used the Ising model 
to explain the collective behavior of biological networks 
and the authors of [5-7] adopted the Ising model for the 
purpose of supporting their hypothesis that the brain works 
at criticality without establishing a clear distinction between 
phase transition and self-organized criticality [8]. Finally, 
we have to mention that the Ising model is frequently used, see 
for instance [9,10], to model neurophysiological data with the 
constraint of maximal entropy. Although relevant to explore 
the cognition properties of the brain, this is a perspective 
different from the one adopted in this paper. 

Couzin [11] made the conjecture that the cooperative 
interaction between the birds of a flock may mimic the 
brain cognition properties, and the experimental observation 
of Cavagna et al. [12] confirmed the criticality condition of 
a coordinated flock of birds. It is important to emphasize that 
the connection between phase transition processes and coop¬ 
erative behavior of biological systems was already stressed in 
the pioneering paper of Vicsek et al. [13]. It is thought [4] that 
biological systems in general operate at criticality. 

The phase transition condition is not confined to biological 
systems. The recent work on econophysics [14,15] suggests 
that the same arguments can be applied to sociological 
systems. As a matter of fact, Sornette [15] proposes a model 
of cooperative economical interaction, of the same type as 
the Ising model, which generates at criticality a transition 
from subcritical to supercritical patterns very similar to those 
produced by the Ising model. As stressed by the authors 
of [5], the Ising patterns at criticality correspond to the 
emergence of correlation links yielding a scale-free network 
statistically indistinguishable from that experimentally ob¬ 
served within the brain, using functional magnetic resonance 
imaging. 


These studies emphasize the spatial and network com¬ 
plexity emerging from the cooperative interaction of the 
network’s units, but overlook the temporal complexity of 
these networks. Herein we are attempting to fill this gap 
and prove that the temporal complexity emerges at criticality. 
Temporal complexity is defined as follows. The time dynamics 
of complex networks is characterized by the occurrence of 
significant events, which may be financial crashes [15], brain 
quakes [5,6], or the changes of direction of a flock of birds 
[12,13]. The time interval between two consecutive events 
is given by a distribution density fr(t), which, drastically 
departing from the conventional Poisson statistics, has the 
inverse-power-law form 

1 

^(r)oc—, (1) 

r /x 

with fi < 2. The occurrence of an event does not have any 
memory of the occurrence of earlier events. This property is 
usually denoted as renewal, but it must not be confused with the 
ordinary Poisson and Markov condition: The signal generated 
by these events is characterized by long-range correlation in 
time, and, most importantly, it is essentially nonstationary, 
thereby breaking the ergodicity that is a fundamental property 
of statistical physics [16]. We refer to it as the non-Poisson 
renewal condition. A significant event is interpreted as a failure 
whose occurrence brings the network back to a brand new 
condition. We refer to these as crucial events. 

Events of this kind may be very difficult to detect. The 
theoretical discussion [16], for instance, applies to the case of 
dichotomous signals, where crucial events correspond to the 
abrupt transition from one to the other value of the signal. In 
the case of the decision making model (DMM) under study, the 
single units in isolation produce perfect dichotomous signals. 
They can be easily detected, but they are of a Poisson kind, 
thereby strikingly departing from the condition of temporal 
complexity. When the control parameter is extremely large, the 
global signal undergoes abrupt transitions that however depart 
again from the condition of temporal complexity, because 
they are predominantly of a Poisson kind. At criticality the 
global signal loses these abrupt transitions that are replaced 
by smoother fluctuations around the mean value. Herein we 
consider a model generating a signal whose time mean value 
vanishes. We assume the crossing of the origin to be the 
proxy of a significant event. Thus, we interpret the signal as a 
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dichotomous fluctuation making abrupt changes from 1 to — 1 
and back again. We evaluate the correlation function of this 
ideal dichotomous signal, denoted by the symbol t,t '), and 
we prove that its dependence on the times t and t' exactly fits 
the prediction of the non-Poisson renewal condition. 

The connection between phase transition, and its spatial 
and network complexity, with temporal complexity is not well 
known. In the literature there are only a few papers where this 
connection is discussed. One is [17], which converts the 3D 
Ising model into a critical map generating type I intermittent 
dynamics and consequently the same temporal complexity 
as the one under discussion herein. The emergence of non- 
Poisson renewal properties was also discussed in [18,19], 
which is devoted to illustrating the cooperative properties of 
a DMM. The foundation of these papers is sociological and 
similar to that of the economical model illustrated in the work 
of Sornette [15]. Thus, the authors [18,19] did not discuss the 
connection of the DMM with the Ising model and adopted the 
nonrealistic but often made assumption of all-to-all coupling 
among the network’s units, thereby confining the emergence 
of an inverse power law, namely, of temporal complexity, to 
the same time scale as that of the units in isolation. 

The motivation for the present research work is given by 
the transfer of information from a complex network to another 
complex network [20]. Rather than using entropic arguments, 
as is usually done with the transfer of information, we open 
the door to the concept of complexity management recently 
advocated in [21]. This latter work established that a complex 
network driven by crucial events is sensitive to stimuli with the 
same complexity. In other words, the transfer of information 
rests on temporal complexity and this explains the motivation 
to prove that criticality also generates temporal complexity. 

In Sec. II we discuss under which conditions the DMM [18, 
19] is equivalent to the Ising model. In Sec. Ill we prove that 
at criticality the function ty(t) becomes an inverse power law 
for over four decades. We shall also argue that with increasing 
the network’s size may become an infinitely extended 
inverse power law. Section IV shows that this is a renewal 
process. Finally we devote Sec. V to concluding remarks. 


II. DETAILED DESCRIPTION OF THE MODEL 


We consider a system of L discrete variables located at the 
nodes of a two-dimensional square lattice. Each unit s t is a 
stochastic oscillator and can be found in either of two states, 
+ 1 or —1. The dynamic is introduced by choosing a single 
unit on site i and updating it in an elementary time step with a 
transition rate g: 


g(s+ l -* s { ') = go exp 
g( s i~ l ■y 1 ) = go exp 



M-i) 

- M-i) 


( 2 ) 

( 3 ) 


Here M denotes the total number of nearest neighbors, and 
M + 1 and M_i the number of nearest neighbors being in the 
state +1 and —1, respectively. Single units change its states, 
thereby making M + \ and M _\ fluctuate in time, while, of 
course, the total number of nearest neighbors is conserved, 
A/_|_i M—i — M. 
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FIG. 1. (a) Temporal evolution of a single unit s t {t) and (b) of the 
global order parameter £ (t) for the decision making model realized on 
a square lattice of L = 50 x 50 nodes, with g 0 = 0-01 and K = 1.70. 
To illustrate the concept of crucial events we mark the time intervals 
r between two consecutive events, according to their definitions 
assumed in this paper. Notice different time scales on both plots. 


All numerical calculation are performed on a square lattice 
of either L = 50 x 50 or L = 100 x 100 nodes with periodic 
boundary conditions. In a single time step a run over the whole 
lattice is performed and for every unit si the transition rate of 
Eqs. (2) or (3) is calculated according to which a node is given 
the possibility to change its state. The single unit in isolation, 
K = 0, fluctuates between states +1 and — 1 with the transition 
rate g = g 0 . When coupling constant K > 0, a unit in the state 
+ 1 (—1) makes a transition to the state —1 (+1) faster or 
slower according to whether M_i > M + \ (M + \ > M_i) or 
Af_i < M +1 (M +1 < M_i), respectively. 

Next, we define the global order parameter §(7) = 

I s i (which is characterized by the variability that 
does not possess the dichotomous character of single units. In 
Fig. 1 we show exemplary temporal evolution for the single 
unit Sj(t) and for the global order parameter §(7). Note that 
the amplitude of variable §(7) depends on the value of the 
coupling constant K. When K — 0, all units in the system are 
independent Poisson processes; thereby an average taken at 
any moment of time over all of them is zero. Once the value 
of the coupling increases, K > 0, single units are less and 
less independent, resulting in a nonzero average. The quantity 
K c is the critical value of the control parameter K , at which 
point a phase transition to a global majority state occurs. In 
numerical calculations we use the time average % eq = (|§(f)|) 
as a measure of this global majority. More precisely, after initial 
10 6 time steps, the average is taken over the same number of 
the consecutive time steps of the model. 

Note that in the special case when M is the same for all the 
nodes and go ^ 1, we find that our model generates the same 
phase transition as the two-dimensional Ising model discussed 
in the seminal paper of Onsager [3]. This is an expected result 
insofar as the Ising model rests on the Hamiltonian 

L 

H = -jJ2wj’ ( 4 ) 
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where cr* and Oj denote the Pauli operators with eigenvalues 
zb 1. According to Boltzmann statistics equilibrium is described 
by the density matrix 

exp (-pH) 

P = ---, (5) 

with /-) = j-y, kg being the Boltzmann constant and T the 
absolute temperature. Thus, the off-diagonal elements of the 
transfer matrix [22] become equivalent to the transition rates 
of Eqs. (2) and (3), under the condition 


k B T = 


J 

K 


M 

~K' 



As examples of conditions yielding this equivalence, we 
consider two cases. The former is the case of all-to-all 
coupling, where M = L. The latter is considered here, and 
it is the two-dimensional lattice where each node is coupled to 
its 4 nearest neighbors, thereby setting M = 4. 

The thermodynamical condition M = L = oo was dis¬ 
cussed extensively by authors of [18,19], who showed that 
under those conditions the ratios are equivalent to the 
probabilities p±\ for a node si to be in one of two allowed 
states. The dynamic evolution of a single unit state is then 
described by a two-state master equation 


j t P+\ = s i l )P+i + g( s i 1 s+ l )p_ u (7) 

f f P -1 = -g^f 1 s+ l )p-i+g(s+ [ -> sr')p +l . (8) 


Solving the above for the difference in probabilities n(£) = 
p+i(t) — p-\(t), which corresponds to the earlier defined 
global order parameter §( t ), one obtains 


d 

— n = 2go[sinh(/ni) — n cosh(/ni)]. (9) 

dt 


This equation yields two solutions, corresponding to global 
majority states, for the values of coupling constant K > K c , 
where Kc = 1. 

The solution to the latter condition of M = 4, L = oo can be 
found in [23] and yields the condition for the global variable to 
b et; eq = (1 — [sinh( J /T/2)] -4 ) 1 / 8 . In this case the critical value 

of the coupling constant is K c = 21n(l + *J2) = 1.7627. In 
Fig. 2, we show the phase transition in the latter condition and 
we find that, as expected, the numerical evaluation of t= eq ( K ) is 
very close to the theoretical prediction of Onsager [3], thereby 
confirming that the DMM is equivalent to the Ising model in 
the limiting case go —> 0. 

This equivalence between the DMM and the Ising model is 
merely formal, because the DMM does not have a Hamiltonian 
origin and does not require the action of a thermal bath at 
temperature T to work as does the Boltzmann picture. This 
explains why the equivalence with the Ising model requires 
that go vanish, so as to freeze the dynamics of the single units, 
in the absence of cooperation. 

When we release the condition go —> 0, the equivalence of 
the DMM with the Ising model is lost. In Fig. 3, we see that, 
if the condition of go 1 is abolished, the phase transition 
emerges at values of K lower than Kc of the theoretical 
prediction. Simultaneously, further increase of go (go ~ 0.40) 
leads to a new regime, in which every unit is surrounded by 



FIG. 2. The phase diagram for global variable % eq . Thin solid line 
and dashed line are the theoretical predictions for the fully connected 
and two-dimensional regular network, respectively. In both cases 
L = oo and the latter case is the Onsager theoretical prediction [3] 
for 2D regular lattice. The thick solid line corresponds to the global 
states observed for two-dimensional regular lattice (L = 100 x 100 
nodes) and go = 0.01. Periodic boundary conditions were applied. 

nearest neighbors in the opposite state, yielding an update of 
its state at every time step and generating the condition in 
which the order parameter %(t) is exactly zero at all times. It 
is worth pointing out that when a network characterized by 
a large transition rate g (transition rate g ~ 1) is studied, it 
shows sensitivity to the initial random configuration of the 
lattice. We observed that in a limited number of cases the 
order parameter evolves to a global majority state, in which 
all the nodes are in the same state rather than in condition 
t=(t) = % eq =0. Therefore one may consider a bifurcation in 
the phase space of the model parameters in this regime. 


III. ORDER PARAMETER REVERSAL TIMES 

Fet us now study the temporal complexity of the order 
parameter %(t). As pointed out in Sec. I, to prove temporal 
complexity of §( t ) we have to observe significant events 
and prove that they are crucial. At criticality the signal is 
not dichotomous and, as we shall see hereby, there are good 
reasons why it must depart from the dichotomous condition. 



FIG. 3. The two-dimensional phase diagram of global variable 
% eq evaluated for a range of model parameters, go and K, on a two- 
dimensional regular lattice of size L = 100 x 100 nodes. 
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FIG. 4. Survival probability function ^(r) for the order parame¬ 
ter f (t) evaluated on a two-dimensional lattice of size L = 50 x 50 
for go = 0.01 and increasing values of coupling constant K. The 
straight line corresponds to the slope of —0.50, namely to /x = 1.50, 
since 'I'(t) ~ 7 ^ T . 

We make the conjecture that the crossing of the origin, namely 
the times at which %{t) changes sign, are the significant events 
to observe. As illustrated on Fig. 1, we interpret the time 
interval r between two consecutive crossings as the time 
duration of a given decision, even if this decision may rest 
on a slight and fluctuating majority. 

We evaluate the distribution density of decision-time dura¬ 
tions r, VK r ), and the corresponding survival probability T'(r), 
where ^(t) = f °° dr\lr(x). Although emerging from a simple 
regular lattice, that is, one with no structural complexity, 
the survival probability presented in Fig. 4 shows a scale-free 
property that extends over more than four decades in time 
for K & Kc . A further increase of K does not affect the 
power-law region and has the effect of producing a more and 
more extended exponential shoulder. The exponential shoulder 
is expected to become predominant for K —> oo. 

As mentioned earlier, a single unit in isolation fluctuates 
between two states with the transition rate g = g 0 . The 
corresponding survival probability function is an exponential 
function T'(r) = exp(— got). At the same time, as illustrated 
on Fig. 5, a coupled unit tends to update its state with a 
transition rate smaller than go. This is a property of criticality 
that is lost completely for very large values of the control 
parameter K. Although it is a computational challenge to 
explore the dynamics corresponding to extremely large values 
of K , our numerical results suggest that when K is very close to 
the critical value, and a decision is reached by a slight majority, 
the single units have a dynamics almost indistinguishable 
from the Poisson dynamic that they would have in isolation. 
As we increase the control parameter K and the majority 
becomes larger the single units keep their Poisson dynamics 
with a smaller rate. Although it is not possible because of 
computer-time limitations to study the network’s dynamics 
for values of K much larger than K = 1.80, it is plausible to 
make the conjecture that the single units maintain their Poisson 
dynamics and that these dynamics become closer and closer to 
that of the global variable that will lose its power-law structure 
and will become predominantly exponential. In other words, 
unanimous and permanent consensus must be perceived as the 



FIG. 5. The survival probability function ^(r) of the global order 
parameter £ (t) is compared with the transitions between two states 
for a single unit s,. Simulations were performed on a lattice of 
size L = 50 x 50 for g 0 = 0.01 and increasing values of coupling 
constant K. 

manifestation of a limiting condition of Poisson but infinitely 
slow dynamics, where the behavior of the single units becomes 
identical to that of the global signal. 

To make this argument more compelling let us discuss the 
properties of the observed exponential shoulder. Since one 
observes the network in an organized phase (.K > Kc) it is 
not surprising that one perceives the exponential signature of 
an equilibrium regime. Therefore we assume that under those 
conditions the global variable §(f) follows the dynamics of a 
particle diffusing in a double potential well. In the equilibrium 
condition one would expect the survival time of staying in 
one well to be T'(r) = exp(— ax), where a = Aexp(— Q/D). 
The parameter Q denotes the height of the potential barrier 
separating the wells, and D stands for the diffusion coefficient. 
Following [19] we expect the barrier to be a function of 
the coupling constant K and the diffusion to depend on the 
number of nodes L. By fitting the shoulder, once for a case 
where L is kept constant and K varies, and secondly for an 
opposite condition, we assess the above hypothesis. The fitting 
procedure revealed that the barrier height Q is a linear function 
of K ( Q ~ K) and that the diffusion coefficient D is inversely 
proportional to the system size L (D ~ 1/L). For L —> oo the 
transition rate becomes infinitely small and the exponential 
shoulder becomes predominant and virtually coincident with 
the Poisson dynamics of the single units. 

While in the organized phase the effect of increasing L 
is that of making the process exponential, although with a 
virtually infinite transition time. At criticality the increase of 
L has a dramatically different effect. The authors of [19], 
based on this DMM in the all-to-all condition, found that at 
criticality an inverse-power-law behavior emerges, which is 
confined, however, to a time region with the same size as the 
Poisson time x P & 1/go- However, this is not the case when 
the units are the nodes of a regular two-dimensional lattice, 
interacting only with the four nearest-neighbor nodes. We 
have evaluated the time size of the inverse-power-law region 
appearing before the exponential shoulder and we found that 
its size tends to increase with increasing L as x P ~ L~ 125 . 
Unfortunately, this observation is limited to values of L 
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smaller than L = 10000, since going beyond would exceed 
the limits of our computer facilities. However, on the basis of 
this observation we reach the conclusion that for L -> oo the 
size of the temporal complexity must become infinitely large. 
Thus, criticality is a kind of complex singularity embedded in a 
Poisson sea. 

These intuitive arguments also explain why the signal %(t) 
at criticality must depart from the dichotomous condition. 
This is a consequence of the fact that the single units are 
not rigidly bound to follow the opinion of the majority. 
There is, consequently, a subtle connection between criticality, 
free will, and correlation between different units. Although 
the single units may change opinion, they cannot do that in a 
way totally uncorrelated from the behavior of the other units, 
insofar as this would be incompatible with the emergence of a 
majority, as slight as it might be. 

IV. TESTING RENEWAL PROPERTIES 

As pointed out earlier, the signal %{t) at criticality is not 
dichotomous. However, replacing it with a dichotomous signal 
corresponding to 1 (yes) when %(t) > 0 and to —1 (no) when 
%(t) < 0, thereby turning it into an ideal dichotomous signal, 
is the most convenient way to reveal the emergence of crucial 
events at criticality. The theory [24] (see also the earlier 
work [25]) allows us to evaluate the nonstationary correlation 
function 4*( t,t') in the specific case when the renewal process 
rests on the waiting time distribution density assumed to 
be an ideal inverse power law. In general, regardless of whether 
this ideal condition of infinitely extended inverse power law is 
realized or not, it is possible to establish whether the process 
is renewal by noticing that 

= *(T,f fl ), (10) 

where r = t — t' and t a = t'. As explained [26], the function 
*P(t ,t a ) can be evaluated by using the time series {£/}, where U 
are the times at which the fluctuation §( t ) crosses the origin. We 
use a mobile window of length t a to evaluate the waiting time 
distribution \lr(r,t a ) and the corresponding survival probability 
*P(t ,t a ), by locating the origin of the window on the time 
of occurrence of an event and measuring the time distance 
between the end of the window and the time of occurrence of 
the first event after the window end. To establish whether 
the process is renewal or not we shuffle the sequence of 
reversal times, thereby generating a new time series {t-}. 
We use the same procedure based on a mobile window to 
determine ^(G )lS , t), which corresponds to the nonstationary 
correlation function in the renewal case. If the two survival 
probabilities coincide, we conclude that the process is renewal. 
If the renewal process is exponential, the correlation function 
is stationary, and there is no aging. 

In the ideal case of an infinitely extended inverse power law, 
with power index /jl < 2, as discussed in earlier work [16,24], 
aging is perennial. In the case studied in this paper, we have 
to take into account that this ideal condition would be realized 
by assigning an infinitely large value to L, with the ensuing 
consequence that temporal complexity would become virtually 
invisible, due to the joint effect of an extremely slight majority 
and of a very extended sojourn in a given decision state. To 
make temporal complexity visible, we need a compromise, 



FIG. 6. Testing for renewal property. Survival probability func¬ 
tion 4flr) of the order parameter £(f) evaluated on lattice of size 
L = 50 x 50 and go — 0.01, K = 1.70 was aged respectively by 
time t a = 100, t a = 1000, and t a — 10000. Those aged survival 
probabilities 4fl4,r) are compared with the aged renewal prediction 
V(t atS , r). 

and we have to set a limit on the time extension of the 
inverse power law. As a consequence, we obtain the very 
interesting result depicted in Fig. 6, where aging, namely the 
slowing down of the survival probability, becomes very large 
in the inverse-power-law region, with the survival probability 
remaining virtually constant, and in the correspondence of 
the exponential shoulder it is virtually suppressed, in accor¬ 
dance with the principle that Poisson statistics annihilates 
aging. 

V. CONCLUSIONS 

This paper establishes that at the onset of phase transition, 
in addition to the spatial and network complexity shown by a 
number of earlier investigations [4-7,9,12,14,15], we also have 
temporal complexity. This significant result reveals a path for 
the transport of information from one complex network to 
another. As pointed out in Sec. I, there is a general agreement 
that complex systems are a set of many units interacting at 
the onset of phase transition. The present analysis proves that 
these complex networks are characterized also by temporal 
complexity, and consequently a perturbing complex network 
is expected to exert its influence on another complex network 
via the recently discovered complexity management process 
[21,31]. Although the present theoretical predictions are based 
on the assumption that ^(r) is an ideal inverse power law and 
the real complex networks, as shown herein, reach this ideal 
condition only when their size is infinitely large, the result 
in Fig. 5 indicates that the time region generating ergodicity 
breakdown may become so extended as to make a complex 
network virtually insensitive to stimuli that do not share the 
same extended nonergodic condition. 

In the last few years the attention of investigators has 
been moving from chaos synchronization of two nonlinear 
oscillators [27] to the synchronization dynamics of many units 
in large-scale networks [28], called inner synchronization [29] 
for convenience. The cooperation induced phase transition 
discussed herein can to some extent be thought of as a form of 







219 


inner synchronization. As pointed out [29] the challenge that 
the researchers in the field of complexity have now to address 
is the phenomenon of outer synchronization [30]. 

In this light, we conclude that the results presented here 
provide a way of addressing outer synchronization, redirecting 
the attention of researchers from the details of the topology 
of the complex networks to the analysis of the temporal 
complexity that inner synchronization may generate. In fact the 


principle of complex management [21,31] rests on temporal 
complexity, regardless of the way it is generated. 

ACKNOWLEDGMENTS 

M.G. and P.G. acknowledge financial support from ARO 
and Welch through Grants No. W911NF-05-1-0205 and No. 
B-1577, respectively. 


[1] H. E. Stanley, Introduction to Phase Transition and Critical 
Phenomena (Oxford University Press, New York, 1987). 

[2] E. Ising, Z. Phys. 31, 253 (1925). 

[3] L. Onsager, Phys. Rev. 65, 117 (1944). 

[4] T. Mora and W. Bialek, e-print arXiv:1012.2242vl. 

[5] D. Fraiman, P. Balenzuela, J. Foss, and D. Chialvo, Phys. Rev. 
E 79, 061922 (2009). 

[6] D. R. Chialvo, Nature Phys. 6, 744 (2010). 

[7] M. G. Kitzbichler, M. L. Smith, S. R. Christensen, and 

E. Bullmore, PLoS Comput. Biol. 5, 1 (2009). 

[8] P. Bak and M. Paczuski, Proc. Natl. Acad. Sci. USA 92, 6689 
(1995). 

[9] W. Truccolo, L. R. Hochberg, and J. P. Donoghue, Nature 
Neurosci. 13, 105 (2010). 

[10] E. Schneidman, M. J. Berry II, R. Segev, and W. Bialek, Nature 
(London) 440, 1007 (2006). 

[11] I. Couzin, Nature (London) 445, 715 (2007). 

[12] A. Cavagna, A. Cimarelli, I. Giardina, G. Parisi, R. Santagati, 

F. Stefanini, and M. Yiale, Proc. Natl. Acad. Sci. USA 107, 
11865 (2010). 

[13] T. Vicsek, A. Czirok, E. Ben-Jacob, I. Cohen, and O. Shochet, 
Phys. Rev. Lett. 75, 1226 (1995). 

[14] H. E. Stanley and Y. Plerou, Quant. Finance 1, 563 (2001). 

[15] D. Sornette, Phys. Rep. 378, 1 (2003). 

[16] A. Rebenshtok and E. Barkai, J. Stat. Phys. 133, 565 (2008). 

[17] Y. F. Contoyiannis and F. K. Diakonos, Phys. Lett. A 268, 286 
(2000); Y. F. Contoyiannis, F. K. Diakonos, and A. Malakis, 
Phys. Rev. Lett. 89, 035701 (2002). 

[18] S. Bianco, E. Geneston, P. Grigolini, and M. Ignaccolo, Physica 
A 387, 1387 (2008). 


[19] M. Turalska, M. Lukovic, B. J. West, and P. Grigolini, Phys. 
Rev. E 80, 021110(2009). 

[20] G. Tkacik, A. M. Walczak, and W. Bialek, Phys. Rev. E 80, 
031920 (2009). 

[21] G. Aquino, M. Bologna, P. Grigolini, and B. J. West, Phys. Rev. 
Lett. 105, 040601 (2010). 

[22] M. Plischke and B. Bergersen, Equilibrium Statistical Physics 
(World Scientific Publishing Company, 2006). 

[23] B. M. McCoy and T. T. Wu, The Two-Dimensional Ising Model 
(Harvard Univeristy Press, Cambridge, 1973). 

[24] P. Allegrini, G. Aquino, P. Grigolini, L. Palatella, A. Rosa, and 
B. J. West, Phys. Rev. E 71, 066109 (2005). 

[25] C. Godreche and J. M. Luck, J. Stat. Phys. 104, 489 (2001). 

[26] S. Bianco, P. Grigolini, and P. Paradisi, J. Chem. Phys. 123, 
174704 (2005). 

[27] L. M. Pecora and T. L. Carroll, Phys. Rev. Lett. 64, 821 
(1990). 

[28] S. Boccaletti, V. Latora, Y. Morento, M. Chavez, and D. U. 
Hwang, Phys. Rep. 424, 175 (2006); J. Zhou, L. Xiang, and 
Z. Liu, Phys. A 385, 729 (2007); A. Arenas, A. Diaz-Guilera, 
J. Kurths, Y. Moreno, and C. Zhou, Phys. Rep. 469, 93 (2008); 
W. Yu, G. Chen, and J. Lu, Automatica 45, 429 (2009); M. Chen, 
Y. Shang, C. Zhou, Y. Wu, and J. Kurths, Chaos 19, 013105 
(2009). 

[29] X.-J. Wu and H.-T. Lu, Chinese. Phys. B 19, 070511 (2010). 

[30] E. Montbrio, J. Kurths, and B. Blasius, Phys. Rev. E 70, 056125 
(2004); C. P. Li, W. G. Sun, and J. Kurths, ibid. 76, 046204 
(2007). 

[31] G. Aquino, M. Bologna, B. J. West, and P. Grigolini (unpub¬ 
lished). 




221 


Noncoherent Physical-Layer 
Network Coding with FSK Modulation: 
Relay Receiver Design Issues 

Matthew C. Valenti, Don Torrieri, and Terry Ferrett 
IEEE Transactions on Communications, 59(9), 2595-2604 (2011) 


Resea rc\\@MRL 



222 


Noncoherent Physical-Layer Network Coding with 
FSK Modulation: Relay Receiver Design Issues 

Matthew C. Valenti, Senior Member, IEEE, Don Torrieri, Senior Member, IEEE, 

and Terry Ferrett, Student Member, IEEE 


Abstract —A channel-coded physical-layer network coding 
strategy is refined for practical operation. The system uses 
frequency-shift keying (FSK) modulation and operates nonco- 
herently, providing advantages over coherent operation: there 
are no requirements for perfect power control, phase synchro¬ 
nism, or estimates of carrier-phase offset. In contrast with 
analog network coding, which relays received analog signals plus 
noise, the system relays digital network codewords, obtained by 
digital demodulation and channel decoding at the relay. The 
emphasis of this paper is on the relay receiver formulation. 
Closed-form expressions are derived that provide bitwise log- 
likelihood ratios, which may be passed through a standard 
error-correction decoder. The role of fading-amplitude estimates 
is investigated, and an effective fading-amplitude estimator is 
developed. Simulation results are presented for a Rayleigh block¬ 
fading channel, and the influence of block length is explored. 
An example realization of the proposed system demonstrates a 
32.4% throughput improvement compared to a similar system 
that performs network coding at the link layer. By properly 
selecting the rates of the channel codes, this benefit may be 
achieved without requiring an increase in transmit power. 

Index Terms —Network coding, two-way relay channel, 
frequency-shift keying, noncoherent reception, channel estima¬ 
tion. 

I. Introduction 

N the two-way relay channel (TWRC), a pair of source 
terminals exchange information through an intermediate 
relay without a direct link between the sources [1]. The 
exchange can occur in two, three, or four orthogonal time 
slots, depending on how the information is encoded [2]. With 
a traditional transmission scheduling scheme , the exchange 
requires four slots. In each of the first two slots, one of the 
terminals transmits a packet to the relay, while in each of 
the last two slots, the relay transmits a packet to each of the 
terminals. By using network coding [3], the number of slots 
can be reduced. With link-layer network coding (LNC), the 
third and fourth slots are combined into one slot by having 
the relay add (modulo-2) the packets that it receives from the 
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Fig. 1. (a) Link-layer network coding, and (b) Physical-layer network coding. 

two terminals. During the third step, the relay sends the sum 
of the two packets, and each terminal is able to recover the 
information from the other terminal by subtracting (or adding, 
modulo-2) its own packet from the received signal. With 
physical-layer network coding (PNC), the first two slots are 
combined by having the two terminals transmit their packets at 
the same time [2]. The relay receives a combination of both 
modulated packets during the first slot, which it broadcasts 
(after appropriate processing) to the two terminals during the 
second slot. PNC-based strategies capable of supporting more 
than just two source terminals over the TWRC may be found 
in [4]. 

The transmission schedules for LNC and PNC are illustrated 
in Fig. 1. The source terminals A/i and A4 transmit messages 
ui and 112 , respectively, where each message is a packet 
containing many information bits. The messages are (channel) 
encoded and modulated by the function Ts(-). In the case of 
LNC, the two messages are sent in orthogonal time slots, while 
in the case of PNC, they are sent to the relay at the same time 
over a multiple-access channel (MAC). For both LNC and 
PNC, the relay broadcasts the encoded and modulated signal 
Tr(u) in the final time slot, where u is the network codeword 
and Tr(-) is the function used by the relay to encode and 
modulate the network codeword. Using the received version 
of Tr(u) and knowledge of its own message, each terminal 
is able to estimate the message sent by the other terminal. 

There are several options for implementing PNC. The relay 
may simply amplify and forward the signal received from the 
end nodes, without performing demodulation and decoding. 
This PNC scheme is referred to as analog network coding 
(ANC) in [5] and PNC over an infinite field (PNCI) in [6]. 
Another option is for the relay to perform demodulation and 
decoding in an effort to estimate the network codeword, which 
is remodulated and broadcast to the terminals. This scheme 
is simply called PNC in [2] and PNC over a finite field 
(PNCF) in [6], but in this paper we refer to it as digital 
network coding (DNC) to distinguish it from ANC. Under 
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many channel conditions, DNC offers enhanced performance 
over ANC. This is because the decoding operation at the relay 
helps DNC to remove noise from the MAC phase, while the 
noise is amplified by the relay when ANC is used. However, 
ANC avoids the computational complexity of demodulation 
and decoding at the relay. 

Symbol timing is a critical consideration in systems em¬ 
ploying PNC. Synchronization of the clocks and packet trans¬ 
missions at the two source nodes can be achieved by network 
timing updates. These updates are routine in networks with 
scheduling mechanisms, such as cellular networks. When the 
propagation times of the signals from the sources differ, the 
symbols arrive at the relay misaligned. The timing offset is 
r = A d/c, where c is the speed of light, and A^ is the 
difference in link distances from the sources to the relay. For 
insignificant delay, we need r << T s j 2, where T s denotes 
the symbol period. This constraint limits the symbol rate. As 
an example, assume A^ = 300 meters. Then, T s >> 2 /is is 
required, and the symbol rate is limited to 250 kilosymbols/s. 
An alternative is to delay the transmission of the node closer 
to the relay by r. However, this requires tracking the distances 
between the sources and the relay. 

A common assumption made in the PNC literature is 
that the signals are coherently demodulated and that perfect 
channel-state information (CSI) is available at the receivers. 
For instance, decode-and-forward relaying has been consid¬ 
ered for binary phase-shift keying [7] and minimum-shift 
keying [8] modulations, but in both cases the relay must 
perform coherent reception. An amplify-and-forward protocol 
is considered in [9], which allows the decision to be deferred 
by the relay to the end-node, though detection is still coherent. 
When two signals arrive concurrently at a common receiver, 
neither coherent detection nor the cophasing of the two 
signals (so that they arrive with a constant phase offset) is 
practical. The latter would require preambles that detract from 
the overall throughput, stable phases, and small frequency 
mismatches. To solve this problem, frequency-shift keying 
(FSK) was proposed for DNC systems in [10] and [11]. 
A key benefit of using FSK modulation is that it permits 
noncoherent reception, which eliminates the need for phase 
synchronization. An alternative to noncoherent FSK is to use 
differential modulation, which has been explored in [12]. 

In PNC systems, it is desirable to protect the data with 
a channel code. The combination of channel coding and 
physical-layer network coding is considered in [13] In [11], 
we investigate the use of a binary turbo code in a noncoherent 
DNC system. When using a binary turbo code in a DNC 
system, the relay demodulator must be able to produce bitwise 
log-likelihood ratios (LLRs) that are introduced to the input 
of the channel decoder. 

Channel estimation is an important issue, especially when 
a channel code is used. A training-based channel estimation 
scheme for PNC at the relay assuming amplify-and-forward 
operation is considered in [14]. The relay estimates channel 
parameters from training symbols and adapts its broadcast 
power in order to maximize the signal-to-noise ratio at the end 
nodes. Estimation of both channel gains in the two-way relay 
channel at the end nodes, rather than the relay, is considered 
in [15]. Novel channel estimators are presented which provide 


better performance than common techniques such as least- 
square and linear-minimum-mean-squared error estimation. In 
[16], we propose a blind channel estimator for the relay of the 
noncoherent DNC system. 

In this paper, we investigate receiver-design issues related 
to the use of noncoherent FSK in DNC systems. While 
noncoherent FSK has been previously proposed for DNC 
syterns in [10], we make the following specific contributions: 

1) We provide closed-form expressions for the relay re¬ 
ceiver decision rule with different types of CSI. This 
is in contrast with [10], which resorted to numerical 
methods to solve the decision rule (see the comment 
below equation (8) in [10]). 

2) We consider the use of a turbo code for additional 
data protection. This requires that the relay receiver be 
formulated so that it produces bitwise LLRs, which may 
be passed through a standard turbo decoder. 

3) We provide results for Rayleigh block-fading channels. 
The results in [10] were only for a phase-fading channel. 

4) We propose a channel estimator that is capable of 
determining the fading amplitudes of the channels from 
the two terminals to the relay. The estimator does not 
require pilot symbols. 

The remainder of this paper is organized as follows. 
Section II presents the system model used throughout the 
paper. Section III derives the relay receiver, while Section 
IV discusses channel-estimation issues. Section V provides 
simulation results, and Section VI concludes the paper. 


II. System Model 


The discrete-time system model shown in Fig. 2 gives 
an overview of the processing at all three nodes. Terminal 
Afi,i G {1,2}, generates a length-Tf information sequence, 
Ui = [i^i,..., u^k\- The two terminals channel-encode 

and modulate their information sequences using the function 
Ts(-), which is common to both nodes. A rate-ri turbo code 
is used, and the resulting length Ls = Kjr\ turbo codeword 
generated by Mi is denoted by = [b^i, ...bi,L s ] (not shown 
in the diagram). The signal transmitted by node Mi during 
signaling interval kT s <t< (k T- 1)T S is 


Si(t) = 


2£i 


T s 


cos 


2?r ( f Ci + 


bi,k 

tT 


(t - kT s ) 


( 1 ) 


where Si is the transmit energy, f Ci is the carrier frequency 
of node Mi (in practice, the carrier frequencies of the two 
nodes are not necessarily the same), and T s is the symbol pe¬ 
riod. Note that (1) is continuous-phase frequency-shift keying 
(CPFSK) with a unity modulation index, which is orthogonal 
under noncoherent demodulation and has a continuous phase 
transition from one symbol to the next [17]. The orthogonally- 
modulated signal Si (t) may be represented in discrete time by 
the 2 x Ls matrix = [x^i, ...,x^ s ] with k th column 


X-i,k 


1 0 
0 1 


T 

T 


if b i>k = 0 
if b lM = 1. 


( 2 ) 


For the DNC system, the signals are transmitted simulta¬ 
neously by the two source nodes over a MAC channel. The 
relay receives the noisy electromagnetic sum of interfered and 
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Fig. 2. Discrete-time system model. 


faded signals, Y, and applies the demodulation and channel¬ 
decoding function r^ 1 (*). The demodulation operation yields 
a soft estimate of the network-and-channel-coded message 
b = bi ® b 2 (not shown), while the channel-decoding op¬ 
eration yields a hard-decision on the network-coded message 
u = ui ® U 2 . With the LNC system, the two sources transmit 
during orthogonal time slots. The received versions of Xi and 
X 2 are demodulated independently to provide soft estimates 
of bi and b 2 . These soft estimates are combined and turbo 
decoded to yield a hard estimate of u. The key distinction 
between DNC and LNC is that with the DNC system, the 
estimate of b is obtained directly from Y, while with LNC it 
is found by independently demodulating the two source signals 
and then combining them. 

During the broadcast phase, the relay encodes and mod¬ 
ulates u using the function T j r(-), which may be different 
than the function Ts(-) used by the sources. The channel 
code applied by the relay is a rate-r 2 turbo code, yielding 
a length Lr = K/r 2 turbo codeword. The code rates r\ 
and v 2 used by the sources and relays, respectively, do not 
need to be the same. In the simulation results, we contemplate 
using a stronger code for the MAC phase than the broadcast 
phase, i.e. r\ < 7 * 2 . The relay broadcasts its encoded and 
modulated signal, which may be represented in discrete-time 
by the 2 x Lr matrix X. The signal traverses two independent 
fading channels, and the end nodes receive independently 
faded versions of X: Zi at A/i and Z 2 at A/ 2 . The end 
nodes demodulate and decode their received signals using 
the function T^ 1 (-), and form estimates of u. Let u denote 
the estimate at Mi and u denote the estimate at A4- Next, 
estimates of the transmitted information messages are formed, 
U 2 = u © ui at Mi and ui = u ® 112 at A/2. Since the 
links in the broadcast phase are conventional point-to-point 
links, specific details of the receiver formulation will not be 
presented here. A detailed exposition of receiver design for 
turbo-coded CPFSK systems in block fading channels can be 
found in [18]. 

All of the channels in the system are modeled as block¬ 
fading channels. A block is defined as a set of N symbols 
that all experience the same fading amplitude. The duration 
of each block corresponds roughly to the channel coherence 
time. Ideally both sources transmit with the same carrier 
frequency f Cl = f C2 . However, due to instabilities in each 
source node’s oscillator and different Doppler shifts due to 
independent motion, it is not feasible to assume that these 
two frequencies are the same at the relay receiver. At best, 
the relay receiver could lock onto one of the two frequencies, 
in which case the received phase of the other signal would 


drift from one symbol to the next. To model this behavior, 
we let the phase shift within a block vary independently from 
symbol to symbol. 

The signal matrix X^ transmitted by node Mi may be 
partitioned into 7V& = Ls/N blocks according to 






where each block X^\ 1 < / < A 5 , is a 2 x ]V matrix, and 
Nb is assumed to be an integer. The channel associated with 
block X) ; is represented by the N x N diagonal matrix 

= af ] x diag(exp{j(9^}, ..., exp{j(9^}) (4) 

where otp is a real-valued fading amplitude and ofl is the 

phase shift of the k th symbol. The {Oflf are independent and 

identically distributed over the interval [0, 2tt). The are 

normalized so that Si represents the average energy of terminal 
Mi received by the relay. The I th block at the sampled output 
of the relay receiver’s matched-filters is then 

Y W = + X^H^ + N w (5) 

where is a 2 x N noise matrix whose elements are i.i.d. 
circularly-symmetric complex Gaussian random variables with 
zero mean and variance Nq. 


III. Relay Receiver 


At the relay, each block Y^ of the channel observation 
matrix Y is passed to a channel estimator, which computes 
estimates of the a\ J and or 2 J . A full description of the esti¬ 
mator is given in Section IV. The fading-amplitude estimates 
and channel observations are used to obtain soft estimates of 
the network-and-channel-coded sequence b. The demodulator 
operates on a symbol-by-symbol basis, and therefore we 
may focus on a single signaling interval by dropping the 
dependence on the symbol interval k and the block index i. 
Let bi and 62 be the turbo-coded bits transmitted by terminals 
Mi and M 2 , and let b = b\ 062 be the corresponding network- 
coded bit. The relay demodulator computes the LLR 




P(b = l|y) 
P(b = 0|y) 


Pjh © 62 = l|y) 
P(h © b 2 = 0|y) 



where y is the corresponding column of Y. The event {bi 0 
62 = 1 } is equivalent to the union of the events {bi = 0 , 62 = 
1} and {bi = 1, 62 = 0}. Similarly, the event {bi 0 62 = 0} 
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is equivalent to the union of the events {bi = 0, b 2 = 0} and 
{61 = 1,62 = 1 }. It follows that 


A(b) = log 


P {{bi = 0 ,62 = 1 } U {61 = 1 ,62 = 0 }|y) 


log 


P ({bi = 0 , 62 = 0 } U {61 = 1 , 62 = l}|y) 

P ({61 = 0 , b_2 = 1} |y) + P ({61 = 1 , b_2 = 0 }|y) 

P ({61 = 0 , b 2 = 0 }|y) + P ({bi = 1, b 2 = l}|y) 


( 7 ) 


where the second line follows from the first because the events 
are mutually exclusive. 


A. LNC Receiver 


In the LNC system, the LLR’s of b\ and 62 are first 
computed independently during the orthogonal time slots and 
are then combined according to the rules of LLR arithmetic. 
The LLR of the signal sent from node Mi to the relay is 

a (m = k. s ^i’-= i ; y ; (8) 

V ' P(bi = 0 |y) 

where y is the signal received during the time slot that node 
Mi transmits. When the fading amplitudes ol^i — 1 , 2 , are 
known, but the phases i = 1,2, are not known, then (8) is 
found using [ 19 ] 


A (bi) 




where Jo(-) is the zeroth-order Bessel function of the first 
kind and y\ and 1/2 are the components of y. If the fading 
amplitudes are not known, but have Rayleigh distributions, 
then (8) is found using [ 19 ] 


A (bi) 


1 + / N 0 




( 10 ) 


Once the individual LLR’s from each end node are found 
using ( 9 ) or ( 10 ), the LLR of the LNC system’s network 
codeword can then be found from ( 7 ) and the independence 
of 61 and 62 when y is given: 


A (6) = log 


>A(M _|_ gA(6 2 ) 


1 _|_ e A(6i)+A(6 2 ) 
max * [A(bi), A(& 2 )] — max * [0, A(6i) + A(& 2 ) 


( 11 ) 


where ma x*[x,y] = \og(e x + e y ), 


B. PNC Receiver 

In the PNC system, it is not sensible to compute A (61) and 
A(62) separately. Instead, use ( 7 ) and assume that the four 
events are equally likely along with Bayes’ rule to obtain 

A(6)=log \p (y|{6i = 0 , b 2 = 1 }) + p (y|{&i = 1 ,b 2 = 0 })] 

- log [p (y|{6i = 0 ,b 2 = 0}) + p (y|{&i = 1 ,b 2 = 1})] • 

( 12 ) 

The computation of each p{y\ {61,62}) by the PNC relay 
receiver given various levels of channel state information is 
the subject of the remainder of this section. 


1 ) Coherent PNC Receiver: When the fading amplitudes 
and phases are known, p (y|{i>i, 62}) is conditionally Gaus¬ 
sian. The mean is a two-dimensional complex vector whose 
value depends on the values of {61, 62} and the complex fading 
coefficients {hiM2}, which are the corresponding entries of 
the H matrix. Let m [61,62] be the mean of y for the given 
values of 61 and 62. When 61 7^ 62, the two terminals transmit 
different frequencies and 


m[0,1] = 

m[l,0] = 


hi 

h 2 


h< 

h 


T 


T 


( 13 ) 


When 61 
and 


62, the two terminals transmit the same frequency 


m[0, 0] 
m[l, 1] 


{hi + 6-2) 0 ] 

0 {hi H- 6*2) ] • 


( 14 ) 


Since there is a one-to-one correspondence between the event 
{61,62} and the mean vector m[6i, 62], it is equivalent to write 
p{ y|{6i, & 2 }) as p (y|m[6 1 ,6 2 ]), where 


P (y|m[6i, 





The coherent receiver computes each of the _p (y|{61,62}) 
required by (12) by substituting the corresponding m[6i,62] 
defined by ( 13 ) and ( 14 ) into ( 15 ). 

2 ) Noncoherent PNC Receiver with CSI: Suppose that 
the receiver does not know the phases of the elements of 
the complex-valued m[6i,&2] vectors, but does know the 
magnitudes of the elements. The knowledge of the magnitudes 
constitutes a type of channel-state information (CSI). Define 
/x[6i, 62] to be the two-dimensional real vector whose elements 
are the magnitudes of the elements of the complex vector 
m[6i, 62]. When 61 7^ 62, both frequencies are used, and 


— 


1—1 


h 2 


T 

QL 1 

OL2 

— 


h 2 


1—1 

: 

T = 

OL2 

Ct 1 


i T 
T 


( 16 ) 


When 61 =62, only one frequency is used, and 

T 


m[o, 0] 
m[M] 


hi + /12 0 

0 hi + 6,2 


a 0 


T 


T 


0 a ] T ( 17 ) 


where a = \hi + /12 1 = \/ a i + + 2aict2 cos (^2 — Of). 

The pdf of y conditioned on /x[6i, 62] may be found by 
marginalizing over the unknown phases 


‘ 2 tt p 2 tt 


p{y\^[h,b 2 ]) 


p( 4 >!, 4 > 2 )p (y I m [61, b 2 ]) <701 dtp 2 • 

( 18 ) 


where <pi and fa are the phases of the first and second 
elements of m[6i,62], respectively. 

Assume that the oti are Rayleigh distributed so that the 
hi are circularly-symmetric zero-mean complex Gaussian*. 


* The receiver derived in this subsection is valid even for non-Rayleigh fad¬ 
ing, provided that the received phases over the two channels are independent 
and uniform over (0, 2n). 
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When bi ^ 62 each element of m [61,62] is a circularly - 
symmetric zero-mean complex Gaussian and therefore has 
uniform phase. On the other hand, when 61 = 62, one element 
is hi + h,2, which is the sum of two circularly-symmetric zero- 
mean complex Gaussians, while the other element is zero. 
Since the sum of two circularly-symmetric complex Gaussians 
is also a circularly-symmetric complex Gaussian, it follows 
that hi + /i2 is a zero mean circularly-symmetric complex 
Gaussian and therefore its phase is uniform. Since the other 
element is zero, its phase is irrelevant and may be set to any 
arbitrary distribution, which is most conveniently chosen to 
be uniform. Thus, it follows that </> 1 and </>2 are i.i.d. uniform. 
Therefore, the pdf conditioned on the magnitudes is 


p(y\v[h,b 2 })= 


1 


X 


ttN 0 



Vi - Pi[bi, b 2 ]e 

N 0 

V2 - 2 


3 fa 


d(p 1 


N 


#2 ( 19 ) 


0 


where ^[61,62] is the k th element of ^[61,62] and 


1 


• 27 r 


27 T 


exp 


Vk - Vk[bi,b 2 ]e 


30 k 


0 


Nt 


0 


= exp 


Dk \ 2 + (l^k[bi,b 2 ]y 


Nt 


L 


d(pk 

2\yk\Hk[biM] 


0 


0 


Nt 


0 


( 20 ) 


Substituting ( 20 ) into ( 19 ), 


p(y\n[bi,b 2 ]) = 


(3 JJexp 

fc=l 


{fik[bi,b 2 })' 


xL 


No 

2\yk\Hk[bi, b 2 ] 


o 


Nt 


( 21 ) 


o 


where 


3 


N, 


exp 


V l + 3/2 


0 


N 


o 



( 22 ) 


which is common to all four {61,62} and will therefore cancel 
in the LLR ( 12 ). 

For each event {61, 62}, substitute the _p (y|/x[6i, 62]) given 
in ( 21 ) with the ^[61,62] given by ( 16 ) and ( 17 ) as the 
corresponding p (y | {61,62}) in ( 12 ). This results in 


A(6) = log 


e -»l/N 0 T 


2a 


y 1 


0 


Ni 


0 


Nq 


e -»l/N 0I ( 2a 2\V2 

V N 0 

2«i y 2 


0 


Nt 


log 


e ~a 2 /N 0 t 


2a 


y 1 


0 


Nt 


+ e -a 2 /N 0 T 


0 


2a 


y 2 


0 


0 


N 


0 


( 23 ) 


As discussed in Section IV, it is possible to accurately esti¬ 
mate a\ and a2 in the considered block fading environment, 
provided the blocks are sufficiently long. However, it is not 
generally feasible to precisely estimate a because the phases 
61 and 62 are varying on a symbol-by-symbol basis. Since 
_£/[cos($2 — 0i)\ = 0, a reasonable approximation when an 
estimate of a is not available is to use 


a 


a? + a 2 


2* 


( 24 ) 


3 ) Noncoherent PNC Receiver without CSI: Suppose that 
besides not knowing the phases 0 \, O2 , the relay receiver does 
not know the magnitude vector /x[6i, 62]. Then, the relay must 
operate without any channel state information except for the 
average energies £1,82 and the noise variance No. When the 
magnitudes /x[6i, 62] are not known, then the conditional pdf 
is found by marginalizing (21) over the unknown magnitudes 

nOO nOO 

p(y\{bi,b 2 }) = / p(iii,ii 2 )p(y\^[b l ,b 2 \)diJi\dii 2 . 

J 0 J 0 

( 25 ) 

where fii and /12 are the magnitudes of the first and second 
elements of ^[61,62], respectively. 

According to ( 16 ), when 61 7^ 62, one of the //& = a\ while 
the other fik = ck 2- Since a\ and a2 are independent and each 
ai is Rayleigh with energy £ ? ;, it follows that the joint pdf of 
fi\ and 112 when (61,62) = (0,1) is 


p(y 1, M2) = 


2/X1 

St 


exp 



exp 


~s 2 


( 26 ) 


for ii \, fi 2 > 0, and when (61,62) = (1,0) it is 


p{y 1,^2) 


2m 

So 


exp 



exp 


T 


( 27 ) 


for Hitk2 > 0 . Substituting ( 26 ) and ( 21 ) into ( 25 ) yields 


p(y\{h = o,6 2 = 1}) = 


V 1 


N 2 


+ 


2/2 


-flog 


1 




N 2 

Nq - + N 


0 


1 


1 -1 


£182 J \£ 1 No J \£2 No 

Similarly, substituting ( 27 ) and ( 21 ) into ( 25 ) yields 


( 28 ) 


p(y|{6i = i,6 2 = 0}) = 


y 1 


N 2 


+ 


2/2 


N 2 

Nr 


-flog 


1 




( 29 ) 


£182 J \£ 1 No 

As indicated by ( 17 ), when 61 = 62, one of the fi k = a 
while the other fik = 0 . As discussed below ( 18 ), in a 
Rayleigh-fading environment, hi and /12 are independent, 
complex-valued, circularly-symmetric Gaussian variables, and 
therefore h = hi -F 6,2 is also a complex-valued, circularly - 
symmetric Gaussian variable. It follows that a = \h\ is 
Rayleigh with energy £1 +82, and the pdf of the nonzero 
ilk is 

2 (Ik 


p{pk) 


exp 


hk 


hk > 0 . ( 30 ) 


£1 £2 ( 81 82 

For the (ik — 0 , its pdf may be represented by an impulse at 
the origin, i.e. p((ik) = £(//&)• Substituting these pdfs with 
the appropriate ^[61,62] into ( 25 ) yields 

p(y|{6i,6 2 }) = log (N -.—) (N-.— + 



N, 


0 


( 31 ) 


£l +^2 
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where i — 1 when ( 61 ,& 2 ) = ( 0 , 0 ) and z = 2 when ( 61 , 62 ) = 

Substituting (28) and (29) for the two 61 7 ^ 62 and (31) for 
the two 61 = 62 into ( 12 ) yields 


A (b) = log 


-Hog 


exp 


£16" 

iN 0 _ 

\yi 


V2 


- log 


exp 


2/i 


2/2 


X 0 
2 


6 


6 


H- exp 


+ exp < - 


y 1 


2/2 


2/1 


X 0 
2 


2/2 


6 


6 


(32) 


where £1 = £ 1 + X 0 , 6 = £2 + AH and f = £1 + £ 2 + X ( 


O' 


IV. Channel Estimator 

The goal of the channel estimator is to estimate the values 
of the fading amplitudes a± and a 2 for a particular fading 
block. Let the fading amplitudes of a block be represented 
by the pair {A, B}, where A> B. Thus, A = maxjai,^} 
and B = minjaa, a^}- Note that in (23), exchanging a\ and 
0 L 2 does not change the final expression. Therefore (23) is 
commutative in ol\ and and may be written as 


m 


max * 



— max * 


F 


( 2 \J A 2 + B 2 \yi 
\ N 0 


F 


2 VA 2 + B 2 


V2 


Nt 


0 


(33) 


where the approximation a ~ has been used and 

F(x) = log[/o(x)], which may be efficiently and accurately 
computed through the following piecewise polynomial fit: 


F(x) = log[/ 0 (x)] « 

' 0.22594x 2 + 0.012495x - 0.0011272 
0.12454x 2 + 0.21758x - 0.10782 
0.028787x 2 + 0.63126x - 0.56413 
0.003012x 2 + 0.88523x - 1.2115 
< 0.00053203x 2 + 0.95304x - 1.6829 

0.00013134x 2 + 0.97674x - 2.0388 
0.9943x - 2.6446 
0.99722x - 3.0039 
k 0.99916x - 3.6114 


0 < x < 1 

1 < x < 2 

2 < x < 5 
5 < x < 15 
15 < x < 30 
30 < x < 60 
60 < x < 120 
120 < x < 500 
x > 500. 

( 34 ) 


A. Fading Amplitude Estimator 

To estimate A and B, first add the two elements of each yi 
to obtain 


n = Hi,I + Vi ,2 = h itl + hi ,2 + rii ,1 + Hi ,2 (35) 

-V-' 

Vi 

where is circularly-symmetric complex Gaussian noise with 
variance 227o, and hi^ is the channel coefficient between 
terminal A 4 , k = { 1 , 2 }? an d the relay during the i th signaling 
interval. The signal r* is the noisy sum of two complex fading 


coefficients, and therefore the fading-amplitude estimation 
algorithm proposed by Hamkins in [20] may be used. To 
determine the values of A and B, a system of two equations 
with two unknowns is required. The first equation, found by 
taking the expected value of |r ^| 2 under the assumption that 
the fading amplitudes are fixed for the block in question, is 


E 

Ti 

2 ] = E ' 



= E ' 


a 


0^2] — Oq + Oi“< 


A,2 - 0 i, 1) 

= A 2 + B 2 . (36) 


The second equation is found by conditioning on the event 
{|r | 2 > A 2 +B 2 }, which is equivalent to {cos(0 ^ 2 — 6 ^ 1 ) > 0} 
and has expected value [ 20 ] 


E 


r 


r 


> A 2 + B‘- 


= A 2 + B 2 + —. (37) 


7T 


Solving (36) and (37) for A and B yields 


^ = 1 

2 

B = - 

2 



Xf\(Y-X)fJxf\{X-Y) 


7T 


X + -{Y-X) 


7r 


X+-(X-Y) 


(38) 


where X — E [|r| 2 ] and Y — E 


r 


r 


2 > A 2 + B 2 


Since the expected values required for (38) are not known, 
they may be estimated by using the corresponding statistical 


averages, 


X = 


Y = 


1 N 
-Y 

N ^ 


\r i 


- E 

N ^ 


Vi 


(39) 


i:\ri 


>X 


where N is the size of the fading block and the factor 2/N 

^ l 19 ^ 

used to compute Y assumes that N > X for approxi¬ 
mately N/2 symbols. If this assumption is not true, then 
the multiplication by 2/N can be replaced with a division 
by the number of samples that satisfy WA > X. As an 

1 1 9 A 

alternative to summing over the N > X, Hamkins proposes 
summing over those |r ^| 2 greater than the median value of 

{All 2 , kivl 2 } [20], 

The estimator works by computing estimates X and Y using 
(39) and the {rr,r/v} for the block. These estimates are 
used in place of X and Y in (38), which yields estimates A 
and B of A and B. These estimates are then used in place of 
A and B in (33). 


B. Transmission-Case Detection 

According to (35), the two elements of are always added 
together. When b\ = 62 , only one tone is used, and the noise 
can be reduced if the receiver processes only the tone used and 
ignores the other tone. This requires that the receiver be able 
to detect whether the first tone, the second tone, or both tones 
were used, which may be implemented using a variation of the 
“no-CSI” receiver described in subsection III-B3. In [16], we 
contemplate an estimator that uses such a transmission-case 
detector. However, we found that the performances with and 
without the transmission-case detector were virtually identical 
and do not consider it further in this paper. At best, proper use 
of the transmission-case detector reduces the noise variance 
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from 2 No to TVq during the symbol intervals that both nodes 
transmit the same tone. As will be seen in the numerical 
results, the estimator is resilient enough against noise that this 
reduction in noise variance is not meaningful and does not 
justify the additional complexity. 


C. Amplitude Estimation for Single-Transmitter Links 

During the broadcast phase, there is only a single trans¬ 
mission, and the dual-amplitude estimator described in sub¬ 
section IV-A is not necessary. Similarly, the estimator is not 
needed by the LNC system during the MAC phase since the 
two transmissions are over orthogonal channels. To estimate 
the fading amplitudes for the links involving only a single 
transmitter and receiver, the simple averaging technique given 
by ( 29 ) in [ 21 ] is used, which is described as follows. Consider 
the i th signaling interval during the i th fading block. Given 
transmission of tone k, in the absence of noise, the k th 
matched-filter output at the receiver is yky = ae j6>i , and has 
magnitude \yk,i\ = ol. All other matched-filter outputs in the 
i th signaling interval are 0 . An estimate could be formed by 
taking the maximum \yry\ over any column of Yf. In the 
presence of noise, an estimate of a can be formed by averaging 
across all columns of the fading block 


a 


1 N 
-Y 

N ^ 


max 

k 


2/fe, 


( 40 ) 


V. Simulation Study 

This section presents simulated performance results for the 
relay receiver described in Section III. The simulated link 
model is as described in Section II, with specific simulation 
parameters given in the following subsections. The goal of 
the simulations is to compare the performance of comparable 
DNC and LNC systems and to assess the robustness of the 
channel estimator proposed in IV. Because the relay-broadcast 
phase of the DNC and LNC systems operate in exactly the 
same manner and have the same performance, we only focus 
on the performance of the MAC phase. 


A. Uncoded Performance with Perfect Channel Estimates 

We initially consider a system that does not use an outer 
error-correcting code, and thus b* = u^, i = 1 , 2 . We compare 
the performance of the LNC and DNC systems. With the LNC 
system, the two nodes transmit their messages in orthogonal 
time slots and the relay receiver first generates the individual 
LLR’s during each time slot using either ( 9 ) or ( 10 ), and 
then the two LLR’s are combined using ( 11 ). When there 
is no outer error-correcting code, performance using ( 9 ) is 
approximately the same as that using ( 10 ). A bit error is 
declared at the relay whenever a hard decision using (11) 
results in an erroneous decision on the corresponding bit of the 
network codeword b. Such an error will usually occur if one 
of the two bits b\ , 62 is received incorrectly, and therefore the 
error rate of the LNC system is approximately Pb 2p(l —p) 
where p is the bit error rate of noncoherent binary FSK 
modulation [ 17 ]. 

With the DNC system, the two nodes transmit simultane¬ 
ously, and the relay receiver computes the LLR using ( 23 ) 



E b / N 0 (dB) 

Fig. 3. Bit error rate at the relay in Rayleigh fading when DNC and LNC is 
used and S 2 = £ 1 . Depending on the amount of channel state information that 
is available, the PNC system will use one of three different relay receivers. 


when the magnitudes ^[61,62] are known or ( 32 ) when they 
are not. A hard decision is made on the LLR and a bit 
error is declared if the estimate of the corresponding network 
codeword bit b is incorrect. We assume that the channel 
estimates are perfect, and since there is no error-correction 
coding, the size of the fading block is irrelevant provided that 
the channel coherence time is not exceeded. 

Initially, we set the average received energy to be the same 
over both channels, i.e. £2 = £1 = £ s = £b- Fig- 3 shows the 
performance of the LNC and DNC systems in Rayleigh fading 
with equal energy signals. As anticipated, the LNC system 
offers the best performance, which is approximately 3 dB 
worse than a standard binary CPFSK system with noncoherent 
detection (the loss relative to conventional CPFSK is due to the 
fact that both bits must usually be received correctly). Three 
curves for the DNC system are shown in Fig. 3 , corresponding 
to receivers that exploit different amounts of available channel 
state information. The best performance is achieved using a 
receiver implemented with ( 23 ), which requires knowledge 
of i,<T 2, and a. The performance of the DNC system 
implemented with ( 23 ) is only about 0.25 dB worse than that 
of the LNC system. The worst performance is achieved using 
a receiver implemented using ( 32 ), which does not require 
knowledge of the fading amplitudes. The loss due to using 
( 32 ) instead of ( 23 ) is about 10 dB, indicating that estimating 
the fading amplitudes at the relay is necessary. 

While it may be feasible to estimate a\ and <22» estimating 
a may prove to be more difficult because it will depend on 
not only the individual fading amplitudes, but also on the 
phase difference between the two channels. Since the phase 
difference might change more quickly than the individual 
amplitudes, it might not be practical to estimate a. If that 
is the case, then the approximation given by ( 24 ) can be used 
in place of the actual value of a. The performance using this 
technique is also shown in Fig. 3 and shows a loss of about 3 
dB with respect to the known-/x[6i, 62] system, which requires 
knowledge of a. 

The performance of DNC is sensitive to the balance of 












229 



E b /N 0 (dB) 


Fig. 4 . Bit error rate at the relay in Rayleigh fading of DNC with three 
different receivers and either S 2 = £\ (solid line) or £2 = 4£i (dashed 
line). 

power received over the two channels. Performance is best 
when Si = £2- In order to evaluate how robust the DNC relay 
receivers are to an imbalance of power, the simulations were 
repeated with £2 = 4 £i, while keeping £5 = £ s = (£i+£2)/2. 
These results are shown in Fig. 4 for the three receiver 
formulations that were considered in the previous figure. 
When the power is imbalanced in this way, there is a loss 
of about 2 dB. However, the loss is the same for all three 
receiver implementations, suggesting that they are robust to 
an imbalance of power. 

B. Uncoded Performance with Channel Estimation 

We now consider the influence of channel estimation, but 
still assume that the system does not use error-correction 
coding. In the simulations, the information frames generated 
at the end nodes contain K = 2048 bits per frame. The fading 
blocks are length N = {8, 32 , 128 } symbols per block. The 
DNC relay implements ( 32 ) and then makes a hard decision 
on each information bit. 

The bit error-rate performance of the uncoded system is 
shown in Fig. 5 . The performance is shown with the estimator 
using the three block sizes N = {8, 32 , 128 } as well as for 
the case of perfect estimates of aq and a^. A narrow range 
of error rates is shown to better highlight the differences in 
performance. In general, smaller fading blocks lead to a less 
accurate estimation of the fading amplitudes, as the number 
of samples available for estimation decreases. Moving from 
block size N = 128 to 32 worsens performance by roughly 
0.25 dB, and from TV = 32 to 8 by 0.75 dB. 

C. Performance with an Outer Turbo Code 

Now consider a system that uses an outer turbo code. 
The terminals each encode length K = 1229 information 
sequences into length L = 2048 codewords, using a rate 
r\ ~ 0.6 UMTS turbo code [ 22 ]. The relay performs turbo 
decoding using the codeword LLR’s computed by ( 32 ). The 



Fig. 5. Influence of fading-block length N on uncoded DNC error-rate 
performance at the relay. In addition to curves for three values of N, a curve 
is shown indicating the performance with perfect fading-amplitude knowledge. 



Fig. 6. Influence of fading-block length N on turbo-coded DNC error- 
rate performance at the relay. Two curves are shown for each value of 
N = {8, 16, 32, 64, 128}. Solid curves denote perfect fading-amplitude 
knowledge. Dashed curves denote estimated fading amplitudes. 

fading-block lengths simulated are N = { 8 , 16 , 32 , 64 , 128 } 
symbols per block. 

The error performance of the coded system is shown in 
Fig. 6, both with perfect channel estimates and with estimated 
fading amplitudes. A good tradeoff between diversity and 
estimation accuracy is achieved for block sizes N = 16 
and N = 32 , which exhibit the best performance of all 
systems that must estimate the fading amplitudes. For N < 16 
performance degrades due to the lack of enough observations 
per block for accurate channel estimates, while for N > 32 
performance degrades due to the reduction in time diversity. 

Fig. 7 shows the SNR required to reach an error rate of 
10 _4 at the relay as a function of the block length N. In each 
case, information is coded with the same ( 2048 , 1229 ) turbo 
code used for Fig. 6. Curves for three systems are shown: The 
noncoherent receiver with known {aq,^}, the noncoherent 
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Fig. 7. Signal-to-noise ratio required to reach a bit error rate of 10“ 4 at the 
relay as a function of fading-block length. The performance of three systems 
is shown: The noncoherent receiver with known { 011 , 0 : 2 }, the noncoherent 
receiver with estimated { 01 , 02 }, and the noncoherent receiver that does not 
use CSI. All systems use a Turbo code with rate 1229/2048. 



Fig. 8. Comparison of error-rate performance between the turbo-coded DNC 
and LNC systems at the relay. The solid lines denote DNC, while the dashed 
lines denote LNC. 

receiver with estimated {cti, 0 : 2 }, and the noncoherent receiver 
that does not use CSI. When {cti,ct 2 } are not estimated, 
performance improves with decreasing N because of the 
increased number of blocks per codeword, which increases 
the time diversity. However, when { 0 : 1 , 112 } are estimated, 
the performance gets worse when the block size is smaller 
than N = 16. The loss of time diversity as the block size 
increases is a common problem for any system operating over 
a slow-fading channel, and the system proposed in this paper 
is no exception. The performance gap between the known- 
CSI and no-CSI receiver formulations widens with increasing 
block length. 

An error-rate performance comparison between DNC and 
LNC is shown in Fig. 8. Both systems use the same 
(2048,1229) turbo code. The LNC system outperforms the 



Fig. 9. Comparison of the performance of turbo-coded DNC and LNC at 
the relay with block size IV = 32. For the DNC system, two code rates are 
shown, with the lower rate code offering comparable performance to the LNC 
system. 

DNC system by margins ranging between 4 and 6 dB. 

While the LNC system is more energy efficient than the 
DNC system when the same-rate turbo code is used, the 
throughput of the LNC system is worse than that of the DNC 
system because the two terminals must transmit in orthogonal 
time slots. The loss in energy efficiency from using DNC ver¬ 
sus LNC can be recovered by having the source terminals use 
a lower-rate turbo code. Consider the performance comparison 
shown in Fig. 9 for block size N = 32. At E^/Nq « 24 dB, 
DNC using a rate r\ = 4500/6400 code matches the error- 
rate performance of LNC using a rate r\ = 4500/5056 code. 
Because the two terminals transmit at the same time, the end- 
to-end throughput of DNC is higher than that of LNC, even 
though the DNC terminals transmit to the relay with a lower- 
rate channel code. 

To illustrate the throughput improvement of DNC over 
LNC, consider the following transmission schedule for the 
two systems. Assume the source terminals use rate r\ = 
4500/6400 in DNC, and n = 4500/5056 in LNC. Assume 
operation at E^/Nq = 24 dB, yielding approximately equal 
relay error-rate performance. Further, assume that both sys¬ 
tems use code rate r *2 = 4500/5056 for relay broadcast, 
yielding approximately equal end-to-end performance. DNC 
requires 6400 channel uses for transmission to the relay versus 
2 x 5056 = 10112 for LNC. Both systems require 5056 
channel uses for relay broadcast. The throughput for DNC 
is thus t( dnc ^ = 9000/(6400 + 5056) = 9000/11,456 bits 
per channel use, and for LNC t( lnc ^ = 9000/(3 x 5056) = 
9000/15,168 bits per channel use. The percentage throughput 
increase of DNC over LNC is thus () /j'iLNC) _ 1 ) x 
100 « 32.4%. 

VI. Conclusion 

A throughput-improving technique for relaying in the two- 
way relay network, digital network coding , is refined for prac¬ 
tical operation. The system operates noncoherently, providing 
advantages over coherent operation: there are no requirements 
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for perfect power control, phase synchronism, or estimates of 
carrier-phase offset. 

A computationally simple technique for estimating fading 
amplitudes at the relay is implemented. Error-rate performance 
in the noncoherent Rayleigh block-fading channel at several 
block sizes is presented. The system is simulated with and 
without an outer error-correcting code. The coded error-rate 
performance of the system using estimation differs from that 
with ideal estimates by margins between 0.7 — 1.5 dB. 

When the same-rate turbo code is used, digital network 
coding has a higher throughput but lower energy-efficiency 
than link-layer network coding . The energy loss of DNC can 
be recovered by using a lower-rate turbo code during the MAC 
phase. Even when the loss of spectral efficiency due to the 
lower-rate turbo code is taken into account, the DNC system 
is able to achieve a higher throughput than LNC at the same 
energy-efficiency. In the particular example presented in this 
paper, the DNC system is capable of achieving throughputs 
that are 32.4% larger than that of the equivalent LNC system, 
while operating at the same energy efficiency. 
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We study the transport of information between two complex systems with similar properties. Both systems 
generate non-Poisson renewal fluctuations with a power-law spectrum l// 3-/ \ the case p = 2 corresponding to 
ideal 1 // noise. We denote by /i s and p P the power-law indexes of the system of interest S and the perturbing 
system P, respectively. By adopting a generalized fluctuation-dissipation theorem (FDT) we show that the ideal 
condition of 1// noise for both systems corresponds to maximal information transport. We prove that to make 
the system S respond when fi s < 2 we have to set the condition pi P < 2. In the latter case, if p P < fi s , the 
system S inherits the relaxation properties of the perturbing system. In the case where pt P > 2, no response 
and no information transmission occurs in the long-time limit. We consider two possible generalizations of 
the fluctuation dissipation theorem and show that both lead to maximal information transport in the condition 
of 1 // noise. 
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I. INTRODUCTION 

Linear response theory (LRT) [1] is one of the basic ways 
of obtaining information from fluctuations in nonequilibrium 
statistical physics [2,3] that is currently adopted to address new 
phenomena such as glassy systems [2] and granular matter [3]. 
An even more challenging issue is the application of LRT 
to complex processes such as physiological processes and 
especially the understanding of brain dynamics. 

It is becoming more widely accepted that the brain operates 
at criticality [4,5] and that the critical condition of a phase 
transition has manifestations that extend beyond the conven¬ 
tional condition of temperature driven systems [6] . Frantsuzov 
et al. [7] adopted a model of cooperatively interacting units to 
propose a solution to the long-standing mystery of the origin 
of the power-law distribution of the blinking times in colloidal 
quantum dot fluorescence. On the other hand, this form of 
intermittence is characterized by the condition of renewal 
aging [8] and by the consequent ergodicity breakdown [9] that 
makes it impossible to use conventional LRT. We refer to these 
systems as complex systems. It is now very well understood 
[10] that the breakdown of the ergodic condition is caused by 
the occurrence of crucial events. It is interesting to remark 
that the same perspective applies to complex networks, when 
the cooperation-induced phase transition produces temporal 
complexity and crucial events as illustrated in in the work of 
Ref. [11]. These events are renewal, namely, the time interval 
between two consecutive events does not have any relation 
whatsoever with the earlier or later time intervals between 
two consecutive events. Yet, if a Gibbs ensemble is suitably 
prepared initially, namely, in all the systems of the ensemble an 
event occurs at the time origin, then the rate of event production 
turns out to be time dependent rather than constant as in the 
ordinary Poisson case. The authors of Refs. [12,13] applied 
this theoretical perspective to the liquid crystal dynamics and 
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experimentally realized a true cascade of renewal events. The 
time interval between two consecutive crucial events is given 
by a waiting times probability density function (PDF) \f{x) 
with the following asymptotic form 


x//(r) = (ji 



J/X-l 

(r + Tf - 1 


5 


and the power-law index pt fulfilling the inequality 



1 < pt < 3. (2) 

It is important to explain the origin of the special form of 
Eq. (1). First of all we want to stress that according to a point 
of view in the field of complexity, only the asymptotic time 
behavior matters, namely, ^r(r) oc l/r M . The adoption of this 
widely shared point of view, as we shall see hereby, would 
prevent us from establishing a correct accordance with the 
experiments on the response of complex systems to external 
perturbations. Thus, the choice of Eq. (1) is dictated by the 
need for defining a border between the asymptotic time regime 
(r T) and the microscopic time regime (r ^ T). Given the 
neurophysiology interest of this paper and especially the focus 
on brain dynamics, we refer the interested reader to the work of 
Ref. [14], where the waiting times PDF of Eq. (1) is obtained by 
means of a Fechner transformation [14] from the conventional 
Poisson distribution. 

The rate of cascade of renewal events tends to a vanishing 
value as l/t 2 _/x when pt < 2 and to a constant value as \/t^~ 2 
when pt > 2. It is evident that in both cases the time duration 
of the out-of-equilibrium condition is infinite, thereby raising 
the challenging task of going beyond conventional LRT to 
describe the dynamics. 

Conventional LRT is given by the following expression: 

o(t) = <ts(0> = € f dsx(t,s)%p(s), (3) 

Jo 

where %s(t) is the fluctuation produced by the system of 
interest S. The symbol (^(0) denotes the Gibbs average 
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over the fluctuations. In the absence of perturbation this 
average is assumed to vanish. The variable %p(t) denotes the 
time-dependent perturbation and € its intensity. LRT predicts 
the response of S on the basis of the unperturbed correlation 
function of In fact, the function x(t,s ), called the linear 
response function, is related to the correlation function of the 
fluctuation § 5 , whose quadratic mean value is assumed for 
simplicity to be normalized to unity 

*s(t,s)=($s(0$s(s)), (4) 

by the following expression 

d 

X(t,s) = (5) 

ds 

Note that the traditional LRT refers to the stationary case 

'I's(M) = 'I'sO - s), (6) 

and as a consequence 

= --j-'I's(M). (7) 

ds dt 

This condition is not fulfilled by complex systems. For these 
latter systems the choices of linear response functions 

d 

X(t,s) = —y s (t,s), (8) 

ds 

and 

X(t,s) = (9) 

dt 

are not equivalent. 

The authors of Refs. [15-18] have discussed the foundation 
of both choices and have established that the new form of LRT 
is determined by the physical way through which perturbation 
determines a bias. For the sake of simplicity these authors have 
made the assumption that %s(t) is a dichotomous signal. Using 
the jargon of turbulence theory they called the time intervals 
between two consecutive crucial events laminar regions. At the 
moment of a crucial event occurrence, unperturbed dynamics 
are realized by the random selection of either the positive, 

= 1, or the negative value, = — 1. In other words, they 
assume that the occurrence of a crucial event generates the 
tossing of a fair coin which determines the sign of the next 
laminar region. Consequently, the external perturbation can 
generate a bias in two different ways. The first way rests on 
affecting the fairness of the coin tossing process. If %p(t) > 0 
($p (0 < 0 ) the choice of the positive (negative) sign is more 
probable than the choice of the negative (positive) sign. This 
prescription leads to the choice of Eq. ( 8 ) and is denoted as 
phenomenological LRT. 

The experiments done by the authors of Refs. [12,13] show 
that nature prefers Eq. (9), the dynamic LRT. What is the 
theoretical argument in favor of the dynamical theory? To 
afford a convincing answer to this important question, let us 
go back to the special form of Eq. (1). We note that we do 
not know the Hamiltonian of our complex system, and we 
do not even know if a satisfactory discussion of the complex 
dynamics can be made using a Hamiltonian formalism. Let 
us assume that Eq. (1) is a reliable representation for the 
distribution length of the laminar region. In this case, a 


reasonable conjecture is that the external perturbation affects 
either /x or T, or both parameters defining the form of Eq. (1). 
We know that /x defines the system’s complexity and emerges 
from the cooperative interaction among interacting units. A 
weak external perturbation is not expected to change the 
system’s complexity. It is therefore reasonable to assume that 
the external perturbation affects T, by enlarging (reducing) 
its value if and §$ have the same (opposite) sign. This 
assumption leads to the choice of Eq. (9), as shown earlier 
[15-18]. 

It is important to notice that the response of a complex 
system of the same nature as the one discussed in this paper 
has been studied by many authors [19-28]. These authors did 
not establish a connection between their results and the LRT 
of Refs. [15-18] and in some cases they made the misleading 
conjecture that their results establish the “death of LRT.” 
Actually, these theoretical treatments are asymptotic in time 
and the only possible connection with LRT is through the 
adoption of the phenomenological theory of Eq. ( 8 ), as the 
readers can establish by a careful reading of Ref. [28]. 

We are now in a position to define the main purpose of 
this paper. We draw the attention of the readers to the recent 
results of the authors Ref. [29]. This paper addresses the 
important issue of studying the response of a complex system 
to a complex external perturbation with the surprising result 
that a complex system does not respond to stimuli that are not 
complex (i.e., that have a stationary Fourier spectrum). It is 
important to stress that the authors of Ref. [29] focused on 
the correlation between % s (t) and %p(t) in the long-time limit. 
This is an ideal condition that has the effect of restraining the 
definition of complexity to the systems with /x ^ 2. In fact, in 
the long-time limit a system with /x > 2 reaches the normal 
condition of a constant rate of event production, thereby 
recovering the ordinary Poisson condition. The condition 
/x = 2 is of fundamental importance for brain function. In 
fact, recent work [30,31] established that the brain works 
with /x = 2, which, in turn, is known [32] to correspond to 
making the brain action become the source of ideal 1 // noise. 
The results of the authors of Ref. [29] may therefore have 
important applications to design the most convenient stimuli 
to drive complex networks, and especially brain dynamics, via 
what was defined as “complexity management” [29] . However, 
an apparent weakness found in Ref. [29] is that these results 
are derived from the adoption of the phenomenological LRT, 
thereby raising doubt that those complex systems, which 
have been proven to obey the dynamical LRT [12,13], may 
not obey the principle of complexity management (CM) 
established in Ref. [29]. Herein we prove that the more 
realistic dynamical LRT generates CM. In addition to this 
main purpose, the present paper affords technical details on 
the theory developed in Ref. [29] that, due to space limitations, 
were not conveniently illustrated. 

II. FDT FOR THE NONERGODIC RENEWAL REGIME: 

PHENOMENOLOGICAL AND DYNAMICAL 

APPROACH 

The authors of Refs. [15,16] discovered a form of FDT that 
applies to systems or networks whose dynamics are dominated 
by non-Poisson renewal events. In the stationary case this 
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FDT becomes indistinguishable from the ordinary theoretical 
prediction [1]. 

Herein we investigate the consequences of the adoption 
of either the “phenomenological” choice Eq. (8) or the 
“dynamical” choice (9), in the special case where both § P (t ) 
and i;s(t) are event-dominated processes and show that the 
transmission of information from P to S is determined by the 
dialogue between the critical events of and the critical 
events of %p(t). Specifically, this discussion is devoted to 
studying the transport of information from P to S, using 
both forms of generalized FDT (gFDT). Note that there is 
no limitation on the form of provided that the coupling is 
weak enough as to be compatible with the emergence of the 
linear response form of Eq. (3). 

To discuss the problem of the transmission of information 
from P to S, it is convenient to imagine the ideal case of a Gibbs 
ensemble of composite systems S + P. Thus, in this paper we 
imagine that P generates a fluctuating signal %p(t) and that 
for any signal %p(t) there exists a response %s(t). For each 
perturbing signal we have to make infinitely many experiments 
and average over all possible responses. For simplicity we take 
both signals %s(t) and %p(t) to be dichotomous and fluctuating 
between values ±1. It is important to remark that Eq. (3), for 
the response of the system S to the perturbation P, is valid 
when the system is prepared at time t = 0 and placed at the 
beginning of a laminar phase, and the interaction with the 
perturbation P is turned on at the same time. 

In the general case of a dicothomous renewal process §(f), 
generated with a waiting-times PDF the probability 

density that fixed a time t\ the first next event is observed 
at time t > t' is given [33] by 


00 nt' 

= VKO + V / - t")dt\ (10) 

n= 1 Jo 

with 

oo 

/?(o=y>„(o, (id 

«=i 

where xjf n (t) denotes the n -times convolution of R(t) is 
therefore the probability density of having an event occurring 
exactly at time t. 

It can be shown as well, see Ref. [33], that the autocorrela¬ 
tion function of the process is related to by 


<$( 0 $(*')> 





( 12 ) 


and therefore coincides with the survival probability T'(cE) 
for the first event (i.e., the probability that, for fixed t\ no event 
is observed until time t > t r ). We assume that the fluctuation 
%s(t) generated by S’ is a dicothomous renewal process defined 
by the probability density 


= (ms 


T s ^ s ~ l 

I + T S Y' ’ 


(13) 


We therefore name, respectively, ^(t,F), Rs(t), and ^s(t,t') 
the functions obtained by replacing in Eqs. (10), (11), and (12) 
\J/(t) with fsit). 

Let us consider now the Gibbs ensemble of composite 
systems S + P, and evaluate the average (§s(0) sp- Note that 


the average is over the separate statistics of the two systems S 
and P 


(§s( 0 )sp = ({%s(t))s)p- (14) 


We select all the responses to the same perturbation, charac¬ 
terized by a given %p(t), we evaluate their average, denoted by 
(§s(0)s, and finally we construct the average over all possible 
perturbations denoted by {...)p so as to obtain the final result 
denoted by (...) sp . In conclusion with this procedure we obtain 

<*(0> = «&(0» = * [ dt'x(t,t')($ P (t')), (15) 

Jo 

where, for notational convenience, we drop the subscripts, but 
we understand the averages in the sense described above. 

If necessary, the signal %p(t) must share the same properties 
as %s(t) and for simplicity they are both assumed to be di¬ 
chotomous signals with random renewal fluctuations between 
the values +1 and —1. %p(t) is therefore a non-Poissonian 
dichotomic fluctuation with the following waiting-time PDF: 


tMO = (Mp 



T^p- i 

It + T P yr ’ 


(16) 


It is therefore convenient to define the additional functions 
x/fp(t,t '), Pp(t), and typ(t,t') obtained, analogously as for S , 
from Eqs. (10), (11), and (12) after replacing ty(t) with the 
waiting-time distribution x/r P (t ). 

The spectrum of this type of fluctuating signal, in the 
absence of perturbation, as calculated in Refs. [32,34], is 


S(f) oc L^- 2 r~\ (17) 

valid for /x < 2, remarkably, even though a stationary auto¬ 
correlation function cannot be defined in this case. In the case 
fi > 2, S(f) = A/f 3-/x , with A independent of L, the length 
of the sequence under study. 

At this point it should be clear to the reader that to get 
the important results of this paper on the transmission of the 
statistical properties of P to S we must use Eq. (15). This 
leads us to give a prescription to define (§>(t)). We assume 
that the perturbation P is prepared at t = 0 at the beginning of 
a laminar phase (i.e., with the analogous prescription adopted 
for the system S ). This assumption allows us to replace (§>(F)) 
with ^ p{t') in Eq. (15). 


III. PHENOMENOLOGICAL APPROACH 

In this section we study the response of a complex system 
producing non-Poissonian renewal fluctuations to a perturbing 
signal with similar properties within the phenomenological 
approach. We analyze both the average response and the 
input-output correlation [i.e., the correlation between the 
perturbing fluctuating signal (input) and the signal produced 
by the system of interest S (output)]. In the phenomenological 
approach, the waiting times between the events generating the 
dichotomic fluctuations remain unchanged by the perturbation. 
The external perturbation introduces a bias so that when 
an event occurs the probability that the dichotomic variable 
changes or keeps its value is slightly different. The function 
X(t,s) in this approach is given by [15,16] 

X(td') = = Rs(t')V s (t -1'). (18) 
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The function ^s(t,t') is the autocorrelation function of 
namely, the survival probability of age t', and Rs(t) for the 
case of the discrete signals considered here, is the rate at which 
events are produced by S prepared at t = 0 [i.e., the bits per 
second encoded in This rate is time independent only 

in the Poisson case. In the non-Poisson case it depends on 
time, thereby making ^s(t,t') nonstationary. The brand new 
survival probability 4^(0 = 4U(M /= 0) is given by [15,16,18] 

*s(t) = a+t/Ts) 1 -'* s , (19) 

from which the corresponding waiting-times PDF i/ssiO = 
— d 4f$ (t)/dt is derived. In the range of parameters 1 < /is < 3 
considered here, it is known [10] that 

sin7r/xc 0 „ 

Rs(t) ~- -=—(Jsl for 1 < < 2, (20) 

Is 

Rs(t)* -[l+(T s /tr s ~ 2 l for 2 < < 3, (21) 

with r s = Ts/(iis — 2) the mean value of V'sCO- 

When /is < 2 the experimental preparation of S induces 
a sequence of events whose rate Rs tends to vanish for t —> 
oo, yielding a perennial out-of-equilibrium condition, and an 
explanation of the death of linear response [18-24,28] as well. 
In fact, the response to a harmonic perturbation of frequency 
/ is proportional to \/{ft) 2 ~^ s [18]. In the case 2 < /is < 3, 
on the contrary, the preparation-induced cascade of events, in 
the limit t —> oo, becomes stationary and virtually identical to 
that of a Poisson process. The theoretical analysis of this paper 
is done in the asymptotic time regime. Thus, we refer to the 
case 2 < /x < 3 as stationary , in contrast to the nonstationary 
case /x ^ 2 of perennial transition. Similarly to the rate of 
events Rs(t) the spectral intensity per unit time tends to vanish 
for /a < 2 as an effect of increasing L [see Eq. (17)]. The 
ideal 1 //-noise condition, corresponding to /x = 2, generates 
instead a logarithmic decrease of the spectral intensity with 
time, and consequently a spectrum virtually independent of L. 


A. Average response to perturbation 

As previously mentioned, the nonstationary LRT (NSLRT) 
of Eq. (3) rests on the preparation of S at time t = 0. We apply 
the same preparation condition to P, thereby generating the 
cascades Rs(t) and Rp(t) described by Eqs. (20) and (21), with 
the appropriate indexing. Under this condition the relaxation 
of (§>/)) becomes identical to the survival probability 4^ ( t ). 
Assuming the condition of Eq. (18) we have the following 
expression for the average response 

(a(t)) = ^ [ Rs(t')V s (t - t')V P (t')dt'. ( 22 ) 

Jo 

The preparation of both S and P makes the average over many 
realizations of the response o{t) to a given stimulus P vanish 
for t -> oo. 

Stationary case: 2 < /as < 3. 

In this regime a finite time scale for the fluctuation §$ exists 
and Rs(t ) reaches the constant value 1/r s. The inverse power- 


law relaxation of 4/^(0 allows us to approximate Eq. (22) by 
replacing Rs(t') with its value for t' — t , that is, 

(cr(t)) - 6 [ dt'V s (t ~ t')V P (t')/T S , (23) 

Jo 

which becomes exact for t —>► oo. The asymptotic behavior of 
Eq. (23) is easily obtained in the Laplace domain 


^ 1 1 - fis(s) 1 - $p(s) 

(cr(s)} ~ e- 


(24) 


T S S S 

which can be studied in the limit of small v. In fact, since [35] 
\j/(s) — 1 — rs + T(/x — 2)s /1 ~ l , /x > 2, (25) 


and 


ir(s) — 1 + T(/x — 2 )s fl 1 , /x < 2 , (26) 


it follows that for 1 < /jl p < 2 and 2 < /jl p < /x^, the time- 
asymptotic behavior is {cr(t)) ~ which is proportional 

to (§>/)) for large t , meaning that the system S “inherits” the 
relaxation properties of the perturbation P . 

For 2 < fis < lip, instead, the asymptotic dominant term 
is (cr( 0 ) ~ t l ~ fls , which is proportional to the ordinary 
unperturbed relaxation to equilibrium (§ 5 /)), when an initial 
bias for ^$(0 is introduced. We see therefore that for /x^ < /x^, 
when %p(t) is slower than §s(t), the perturbation P imposes 
on the system S its own relaxation properties thereby allowing 
one to “manage” the complexity of a system or network by 
using an appropriate stimulus. 

Nonstationary case: 1 < /as ^ 2. 

In this regime, the system S violates the finite-time 
scale condition necessary for stationary dynamics and, in 
fact, Rs(t) oc t^ s ~ 2 , see Eq. (20). With such replacement in 
Eq. (22), a convolution form appears which can easily studied 
via a Laplace transformation. In the Laplace domain (see 
Appendix A for details on coefficients) 

(a(s ~ 0)) ~ a s s /Js ~ 2 + a P s^ p ~ 2 , (27) 


which implies that if/x/> > 2 or if 1 < /is < fip then (cr( 0 ) ~ 
t l ~^ s . When 1 < /up < fas, we have {cr(t)) ~ t l ~^ p . Also in 
this case the perturbation P forces onto S its own relaxation 
properties to equilibrium. 


B. Input-output correlation function 

We study the cross-correlation (or input-output correlation) 
function between the system S and the stimulus P : C(t) = 
((§s (t) ( 0 )), which is also used as an indicator of aperiodic 

stochastic resonance [36]. Multiplying both sides of Eq. (3) by 
§ p(t ) and averaging over the fluctuations of the perturbation 
P we obtain 

Q(t) = C(t)/e= [ dt'R s (t')V s (t-t')Vp(t,t'). (28) 

Jo 

Note that both Eqs. (22) and (28) depend on the survival 
probability of P, but in the former such survival probability 
depends on the single time t' whereas in the latter it depends 
on both t' and t. We limit ourselves to report the results for 
the asymptotic value Ooo of O(t). When ^$(0 and § P (t ) are 
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not stationary, that is, when 1 < /i s ^ 2 and 1 < /ip ^ 2 , 
Eq. (28), in the limit t —> oo, gives 

= £(Ts,Tp) = T(/is + lip ~ 2) 

3F2 [{/Ip — l,lip — l,lip + fls — 2 },{flp,llp},l] 

X r (2 - iip)T{iipYT{iis - 1) ’ 

(29) 

where 3 F 2 is the generalized hypergeometric function. For 
more details see Appendix A. In the case 2 < /ip <3, Ooo is 
simply zero. 

In the case 2 < /is < 3, inserting into Eq. (28) expression 
(21) for R s (t ), leads to 

/•* \pc(^ - E) 

0 ( 0 = / --Op(t, 0 ) = o 5 ( 0 )Op(t,o 

Jo *s 

-VsWp(f,0)- p d y \ (30) 

where 0^(0 is given by Eq. (19) after replacing /is with 
lis — 1. Equation (30) is exact for t Xs and for 1 < /xp ^ 2 
it leads to Ooo = 1 since both the second and third terms 
disappear for t —> 00 and the first is trivially 1 in the same 
limit. For Eq. (30) 2 < /i P < 3 it yields 



FIG. 1. (Color online) The asymptotic limit of 0(0 is displayed 
for /is,lip e]l,3[. The vertex /i s = /i P = 2 marks the transition to a 
condition of maximal input-output cross correlation. 

the abrupt transition from vanishing (square II) to maximal 
correlation (III). 


= 1 - (up - 2)T'p P 2 T^ s 2 A 


xB 


AT/Tp . 2 — /ip 

A T/T s ,fls + Vp ~ 4 ’ 3 _ 



0 


[is — 2 

Tp + /is ~ 4 


(31) 


where AT = \Ts — T P \, B[x,a,b ] is the incomplete Beta 
function and the upper (lower) choice of the parameters refers 
to the case T s > ( <)T P . The final expression in Eq. (31) 
corresponds to the case Ts = Tp. The results are summarized 
in Table I. For illustrative purposes, we supplement Table I with 
Fig. 1, showing the three-dimensional (3D) plot of the cross¬ 
correlation function in the same parameter range: Square 
II and square III correspond to the condition of minimal and 
maximal correlation, respectively. Intuitively it is so because of 
the difference of time scales between S and P in such regions. 
In III fluctuations %s(t) and % P (t) have a finite and an infinite 
time scale, respectively, thereby allowing %s(t) to adapt to the 
stimulus-induced bias so as to yield maximal correlation. In 
II the role of the time scales is inverted, the bias induced by 
P on the longer (diverging) time scale of the process %s(t) 
is asymptotically averaged out due to the many intervening 
switching events of § P (t ), producing no correlation. The 
vertex /i s = /i P = 2 , representing a 1 //-noise generating 
system under the stimulus of a 1 //-noise perturbation, marks 


TABLE I. Summary of the asymptotic values of the cross¬ 
correlation function <$>/) in the phenomenological case. 


\ip- 


1 < lip ^ 2 


2 < fip < 3 


1 </x 5 ^2 

2</jL S <3 


^00 — £(/iSi/ip) I 

Ooo = 1 III 


Ooo =0 


US-2 


ps+pp-4 


II 

IV 


See Eq. (29). 


IV. DYNAMICAL APPROACH 

In this section we extend the analysis of the previous section 
to the dynamical approach. Within such approach we derive 
both the average response and the input-output correlation 
function. 


A. Average response to perturbation 

Starting from the property 


(<r(0> = «&(*)», (32) 


obtained by averaging over the fluctuations of both S and P 
and using the dynamical condition of Eq. (9), one obtains 

(33) 

at 

Equation (3) then becomes 

(o{t))=e f dt'1r s (t 9 t')($p(t% (34) 

Jo 

For simplicity we prepare the perturbation §>/) at t = 0. 
To observe the influence of P on S we select all the systems 
of the Gibbs ensemble so that §>(0) = 1. In this case (§ P (t ')) 
is given by the survival probability 4> P (t ) [33]. Thus, Eq. (34) 
yields 

(a(0) = € [ dt'\lr s (t,t')V P (t'), (35) 

Jo 


where ^p{t f ) is the survival probability for the process % p (t) 


Vp(t) 



dx^p(x) = 


T P 


t + T P 


fip—i 


(36) 


This slow decay corresponds to the probability that no 
perturbation event occurs up to time t. The system S evolves 
in time so as to reach a steady value that corresponds to 
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a constant perturbation abruptly applied at t = 0. However, 
during this process a perturbation event occurs that has the 
effect of suddenly changing the external field. Thus, o(t) does 
not reach a steady value, but after reaching a maximum value 
will decay. Under the specific conditions discussed in this 
section, the time asymptotic decay of {cr(t)) in Eq. (35) has 
the same power-law index as that of survival probability TV(t). 
We interpret this phenomenon as the transmission into S of the 
statistics of P . 

We explore again the whole range of parameters 1 < /i s < 
3 and 1 < /ip < 3, respectively, for the system S and the 
perturbation P, depending on the values of the power-law 
indexes /is, /ip characterizing their waiting-times PDFs. The 
value /i = 2 marks the transition from a finite to an infinite 
mean time (i.e., the transition to a nonergodic, nonstationary 
condition. In fact, while for /i > 2 a mean time exists, a finite 
time scale can be defined and a stationary condition is reached, 
for /i < 2 such condition is never achieved, not even in the 
infinite mean time. 

Stationary case: 2 < /is < 3. 

In this regime the waiting-times PDF i//s(t) has a finite 
mean value is. Therefore a finite time scale tc exists such that 
for t > t' > tc the following approximation corresponding to 
reaching a stationary condition, is valid [33] 

1 f°° 4 fc(t - t') 

MS) ~ - / dxM - t') = —-(37) 

Jt Ts 

where Xs = Ts/(/is — 2) is the mean value of V'sCO- Equa¬ 
tion (37) is exact for tc -> oo. We can also use Eq. (3) for 
the response in the case when the interaction is turned on at 
a later time tc that we assume to be so large as to satisfy 
the approximation Eq. (37) using the following procedure. 
We introduce into Eq. (3) an effective perturbation which is 
turned on at time t = 0 but is zero until tc [i.e., §£ ff (f) = 
0(t — tc)%p(t — tc)]. In this scheme we can assume that the 
“real” process %p(t) is prepared at time t = tc in a brand new 
condition. Time tc therefore corresponds to the age of the 
system S. The average response {cr(t)) of S in this case reads 

(or(O) = ^ [ dt'MSmt' - t C )M' ~ tc)) 

Jo 

= S \ dt f \lrs(tSyVp(t f - t c ), (38) 

J tc 


and the approximation (37) can be used to replace \//s(t,t'). 
With the substitution r = t' — tc and after renaming t — tc a s 
t back again, the average response of the system S of age tc 
reads 

(cr(t)) ^sit-x) 

t c = / dx ——- -^ P (x) = ^s(0)^ P (t) 

£ Jo Ts 

-Vs(t)Vp( 0 )- f dt'M-t')4-v P (t'), 

Jo dt f 

(39) 


where Tfs(t) is defined as 


y s (t) = (l+t/T s r s - 2 . (40) 

Equation (39) is exact for tc —> oo and coincides with the 
expression (23) obtained in the phenomenological case in the 
same regime. Therefore, in the limit t Ts,T P , the same 
considerations apply, that is, if 1 < /i P < 2 and 2 < /jl s < Pp, 
the time-asymptotic behavior is (cr(t)) ~ t 1-/Xp , which is 
proportional to ($p(t)) for large t, therefore the system S 
always “inherits” the relaxation properties of the perturbation. 
If 2 < fi s < Pp, the asymptotic dominant term is {cr(t)) ~ 
t l ~^ s , which is proportional to the ordinary unperturbed 
relaxation to equilibrium (§s( 0 )- 


Nonstationary case: 1 < Ms < 2. 


Let us make the assumption that, although at time t = 0 
half of the S elements of the Gibbs ensemble are in the 
state = +1 and half in the state §$ = — 1 , all of them 
are at the beginning of their sojourn in the corresponding 
states. This is an out-of-equilibrium condition, corresponding 
to preparing the system at t = 0. The calculations are detailed 
in Appendix B. Using the dynamic theory we obtain 


(cr(0) _ ki(ix s ,ttp) k 2 (ji s ,^p) 




t^p 


pip + l-tis ’ 


(41) 


where the first coefficient is given by 

T P ‘ XF ~ l sin(7r/^s)r(2 - fi s ) r(l - ii P + (jl s ) 




7 tT (3 — /ip) 


(42) 


and the second coefficient is determined to be 


k2(iis,tip) 


[sin(7r/x t s)T(3 - /x^)T( 1 - \i P + ii s )T(2lis - 3) - (2 - il s )T(2il s - tip - 1)] 
T P l ~^ p Ts fls ~ 2 Tt(iip — 2)(2 — /xs)T(3 — fisWifis ~ PpWQps ~ 3) 


(43) 


In this range of parameters the dominant term is always the 
first term in Eq. (41) which, if /ip < /is , is also slower than 
the unperturbed relaxation to equilibrium of In the latter 
range, therefore, the system S relaxes to equilibrium inheriting 
the same properties of the perturbation P . 

The result in Eq. (41) is of special interest since it discrimi¬ 
nates between the two approaches in the nonstationary regime. 
It is this difference that allowed to determine that liquid crystals 
[13] follow the prediction of the dynamical approach. In fact, 


the phenomenological approach disregards the influence of the 
perturbation on the occurrence time of the S events [19,28], 
while the dynamical theory does not, thereby affording a 
criterion for information transport that we judge to be a more 
appropriate representation of the communication among com¬ 
plex systems with /i < 2. However, the equivalence between 
the phenomenological and the dynamic theories in the case 
when the system S is infinitely aged (i.e., for /is > 2) indicates 
that the generalization of FDT given by Eq. (9), namely, the 
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dynamical theory, becomes active only when the system S 
is in a far from equilibrium condition and begins drifting 
toward equilibrium. Although equilibrium is never reached 
when (i < 2 , the correlation function tends to recover 

the property 4^(7 ,t') = ^ s (t — t') that makes the phenomeno¬ 
logical theory formally equivalent to the dynamical theory. 

B. Input-output correlation function 

Herein we study the asymptotic limit of the input-output 
correlation function 


Ooo = lim C(t)/e, (44) 

t —> oo 

within the dynamical approach. The input-output correlation 
function is again defined by the average over the fluctuations 
in both the system S and perturbation P 

C(t) = (£ s (t)£p(0)- (45) 

The asymptotic limit of C(t) is independent of the way the 
system and the perturbation are prepared, so we can use the 
prescription leading to Eq. (3) obtained assuming that both S 
and P are prepared at time t = 0. We therefore use the same 
arguments as those yielding Eq. (28) and, adapting them to the 
dynamic theory, we obtain 

O(t) = C(t)/€ = f dt'\l/ s (t,t')^ P (t,t'). (46) 

Jo 

Nonstationary case I: /is < 2, tip < 2. 

We use Eq. (46) with the general expressions for x//s(t,t') 
and typ(t,t') as obtained through Eqs. (10) and (12), re¬ 
spectively. In this case, taking the limit t —> oo yields (see 
Appendix B for details) 


with Op(t) given by Eq. (48). For /i s > 2 a finite mean time 
xs of exists, therefore a finite time scale tc oc xs exists 
such that for t > tc Eq. (37) can be used again to approximate 
tys(t,t')- With such substitutions, the expression for 
the correlation function becomes asymptotically equal to the 
correlation obtained in the phenomenological approach in the 
same regime [i.e., Eq. (30), leading to the final result Eq. (31)]. 

Stationary case II: /is > 2, tip < 2. 


In this case again a finite time scale tc can be found such 
that the approximation in Eq. (37) is valid. Thus, again Eq. 
(46) can be rewritten 


O(t) 22 f dt'\l/ s (t,t')^ P (t,t') 
Jo 


+ 


r , 

d ~ 

/ dt’ 


't c 

dt' J 


'Mm')- 


(50) 


This case is therefore equivalent to what is obtained for 
the phenomenological case in the same range of parameters. 
Again, in the asymptotic limit t —> oo, the first term in Eq. (50) 
vanishes and after integrating the second term by parts, one 
obtains 


O(t) 22 TO(0)Op(t,0 — ^s(t ~ A:)4V(Mc) 
lt . ~ . d^p(t,t') 

dt'v s (t-t'y 

tc 


f 


dt 1 


(51) 


In the asymptotic limit the second term on the right-hand 
side trivially vanishes and also the third term side can be shown 
to vanish (see Appendix B). The only remaining contribution 
in Eq. (51) is given by the first term, which is exactly one. It 
follows 


= lim 0(0 = 

t—>oo 

x F[{/i P - 


sinn/ip r(/ip +/is — 1 ) 

IX {tip - 1 )T(/ip + 1 )V(ti S - 1 ) 

1 ,dp — 1 ,dp + ds ~ 1 },{t l P,t l P + 1 }> 1 ] • 


(47) 


Nonstationary case II: /is < 2, tip > 2. 

In this case we can assume that, when the interaction is 
turned on, the perturbation P has already reached a stationary 
condition, so that in Eq. (46) x/rs(t,t') is given again by Eq. (10) 
but Op(eE) = Op(t — t'), with 

Vp(t) = (1 + t/Tpp p ~ 2 , (48) 

and this expression has to be directly inserted into Eq. (46). 

The power-law index /i P — 2 reflect the stationary condi¬ 
tion realized with the preparation of the perturbation P at a 
time t P = —oo. In this case, we obtain 


lim 0(0 = 0 . (49) 

t^oo 

In the time asymptotic limit, the system S turns out to be 
independent of P in spite of the fact that at t = 0 we switch 
on the S-P interaction. 


Stationary case I: /is > 2, (ip > 2. 

We again use Eq. (46) and assume that, when the interaction 
is turned on, the perturbation P has already reached a 
stationary condition so that in Eq. (46) ^p(t,t') = O P (t — t') 


Ooo = lim C(t)/e = 1. (52) 

t—>oo 

Let us make here a few remarks on these results. Of course 
in the absence of coupling, 6 = 0 , which means that C(0 
always vanishes. We switch the interaction on at t = 0. Thus, 
we always have the zero cross-correlation initial condition 
C( 0) = 0. 

As an effect of switching on the interaction at t = 0 we 
realize the condition C(t ) ^ 0, for t > 0. However, we find 
that there exist special conditions (region II in Fig. 2) for 
which the cross correlation again goes to zero, asymptotically 

Ooo = lim C(t)/€ = 0. (53) 

t^oo 

This indicates that only in such conditions, the system S , after 
a transient, recovers the condition of statistical independence 
of the perturbation § P (t ). The fluctuations §s with /is < 2 turn 
out to be statistically independent of %p(t) only when /ip > 2 . 
The environmental perturbation with /ip <2, on the other 
hand, is characterized by the remarkable property of forcing 
the system S to respond, regardless of the value of /is. 

Table II and the plot in Fig. 2 summarize such results 
and show the same qualitative properties observed for the 
correlation function in the phenomenological case, with the 
correlation functions for the two approaches being identical 
in squares I, II, and IV. The condition of ideal 1// noise (i.e., 
li s = /ip = 2 ) marks the transition from a condition of zero 
to maximal correlation. 








241 


TABLE II. Summary of the asymptotic values of the cross¬ 
correlation function 0(0 in the dynamical case. 


Iisi Pp^ 1 < /ip ^ 2 2 < fip < 3 

1 <Ps^2 Oqo = £d(Ps,Pp) I = 0 II 

2<^<3 Ooo = 1_m Oqo = IV 

*See Eq. (47). 

V. FINITE RESPONSE AND 1/f RESONANCE 

Herein we proceed to demonstrate that the intensity of 
the response a{t) to a single realization of the stimulus does 
not decay if ^ 0. This demonstration, therefore, is valid 
for both the dynamical and the phenomenological approach, 
leading to a general result. Let us define with p t (^ l s ) the 
probability that at time t the variable takes the value 

i = =bl and with p t (£ l s \%p) the conditional probability for 
the occurrence, at time t, of a value t= s = i = ± 1 , given 
the occurrence of a value = j = ± 1 . By definition, the 
nonvanishing yields 

C(t) = J2 ij Pt{Ss\H) Pt(H) -»• e<I> oo- (54) 

i,j=± 1 

We note that for t —> oo, on a time scale such that (§p(0)» 
which decays as (§>( 0 ))£ 1_/Xp , is a second-order quantity 
(9(£ 2 ) we have that 


Pt{Hp) = 1/2 + 0(s 2 ), 

(55) 

and 


4>(r) = <J>oo + 0 (e 2 ). 

(56) 

Thus is due to Eq. (54) and to the symmetry of the considered 
dichotomous processes 

Pt(?s\H) ->• 2 + l J £ 2 ■ 

(57) 



FIG. 2. (Color online) The asymptotic limit of O(f) is displayed 
for the different regimes of parameters for 1 < p s < 3 and 1 < p P < 
3, in the dynamical approach. 


In the same long-time scale, Eq. (57) yields 

K0>± ^ f - ±£< 1 > c», (58) 

i 

where the subscript ± indicates the value of at time t. 
Summing Eq. (58) over the two values of gives a total 
average null response, as expected. But if the magnitude \cr(t)\ 
of the response to a single instance of the input %p(t) is 
considered instead, its total average is 

(KOI) = \ £<K0|> ± £ \ L K ff (0>±l - e®co, (59) 

db db 

where an equality holds if terms of order 0(s 2 ) are neglected. 
Thus when > 0, the response cr(t) to a single instance of 
the input %p{t) does not die out and remains proportional to 
the stimulus intensity , no matter how large t becomes. Square 
III in both Figs. 1 and 2 is the plateau region of maximal cross 
correlation and response. Claims regarding the death of linear 
response , as in Refs. [19-28], are therefore appropriate only 
in relation to the vanishing correlation of square II. The total 
average response (<r(7)) always tends to vanish for t —>► oo for 
reasons that do not imply a lack of response except in the case 
of square II. 

The reason for the striking difference between the response 
to a harmonic perturbation and the response to a nonergodic 
stimulus is intimately related to the emergence of 1 // noise 
and to its spectrum described by Eq. (17), which assigns the 
weight S(f)/L = l/(/L) 3-/Xp to the spectral component of 
frequency / of a nonergodic stimulus. As a consequence, the 
stimulus generates, in time, lower and lower frequencies / so 
as to keep 1 /(/L) 2-/Xs (i.e., the response intensity to frequency 
/ [18]) finite, thereby yielding Eq. (59). The death of linear 
response [19-28] is caused by the fact that stimuli with fixed 
frequencies cannot cope with the decreasing frequency of the 
cascade of events of Eq. (20). 

We have afforded a compelling proof that the intensity of the 
single realizations of cr(t), with ps < 2 , does not decay if the 
perturbation (t) falls in the same complexity basin (pp < 2 ). 
This is the phenomenon of complexity management which 
allows to define the right stimulus to obtain a response from 
a system or network producing nonergodic fluctuations. Now 
we argue that 1 // stimuli generate the maximum information 
transport by looking at the mutual information 

m = J2p>^p)p>^M)^g[p,yM)/p,y s )i c 6 o> 

cj 

Using Eqs. (55), (56), and (57) it follows that 

I(t oo) ~ e 2 <J>^, (61) 

and the information transmission rate is obtained by multiply¬ 
ing I(t) by the input rate [37], given by Rp(t). If tip > 2,Fig. 1 
shows that < 1. Although square III in Figs. 1 and 2, indi¬ 
cates that all stimuli with p P ^ 2 induce maximal correlation, 
lip < 2 corresponds to a stimulus with decaying events rate 
(input bits/s) Rp(t). So even if a response is produced in this 
regime, the rate of information vanishes in time. 

Only at the crucial condition /jl p = 2, of ideal 1 // noise, 
does this algebraic decay become logarithmic, and, conse¬ 
quently, a steady and maximal information transmission rate is 
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achieved. This is the phenomenon that we call 1 // resonance. 
The above considerations are valid for both the dynamical 
and phenomenological approaches, therefore we consider the 
condition of maximal information transmission achieved in 
the ideal 1//-noise condition, a fundamental property of 
1 //-noise renewal processes. 

VI. CONCLUDING REMARKS 

The growing interest for the dynamics of complex networks 
is shifting the attention of the researchers from the synchro¬ 
nization of two stochastic units [38] to the synchronization of 
a large number of units [39], an interesting phenomenon that 
is closely related to the very popular model of Kuramoto [40]. 
The single units of the processes of chaos synchronization are 
chaotic and do surprisingly synchronize while maintaining 
the erratic dynamics that they have in isolation. Although 
the single units of the Kuramoto model are regular, it is 
becoming increasingly evident that the emergence of a global 
synchronization is a condition independent of whether the 
single units are regular or stochastic. The single units of the 
work of Refs. [1 1,41] are Poisson processes and if one of them 
drove the other, they would obey the principle of aperiodic 
stochastic resonance [42]. If the two units are bidirectionally 
coupled they are expected to undergo a condition of perfect 
synchronization if the coupling is sufficiently intense. When 
the number of interacting units is very large a phase transition 
occurs from the noncooperative to the cooperative behavior 
[1 1,41]. It is important to stress that at criticality no permanent 
consensus is reached, and the mean value of the global field 
vanishes. Yet, this condition is strikingly different from the 
noncooperative condition. The whole network remains in the 
“yes” (“no”) state for an extended time before making a 
transition to the “no” (“yes”) state. 

It is surprising that the phase-transition literature seems to 
have overlooked, with only a few exceptions [11,41,43], that 
the transitions from the “yes” (“no”) to the “no” (“yes”) state 
occurring at criticality are the “crucial” events defined in Sec. I. 
In other words, the time interval between two consecutive 
transitions is derived from a PDF that has the asymptotic time 
structure of Eq. (1) with a power index /x fitting the inequality 
condition of Eq. (2). Some authors [1 1,41] argued that /x = 1.5 
and others, [7], releasing the condition that all the units share 
the same Poisson rate, generate a global condition with crucial 
events characterized by /x < 2, but significantly departing from 
the value /x = 1.5. Note that the theoretical arguments of 
Ref. [44], yield the misleading impression that the crucial 
value of /x is a consequence of ordinary statistical physics. 

According to some authors [45,46] the Kuramoto phe¬ 
nomenon can be defined as inner synchronization. A network 
of cooperating units located on the nodes of a complex network 
may reach inner synchronization with different values of the 
control parameter, depending on the network topology [47]. 
This is a subject of increasing interest with attractive appli¬ 
cations to the dynamics of the human brain [45]. If we adopt 
this perspective, we can address the problem of information 
transmission from one to another complex network as a process 
of outer synchronization. This is an interesting issue, but the 
conditions to fulfill to realize outer synchronization are not yet 
clear [45]. 


An important result of this article is the discovery of a 
promising road to settle the problem of information trans¬ 
mission from one to another complex network. In fact, if the 
inner synchronization corresponds to a criticality condition 
and criticality generates crucial events with a power-law index 
/x fulfilling the inequality of Eq. (2), then a complex network 
at criticality is a generator of 1 // noise, with S(f) oc 1 // 3-/x . 
Thus the problem of information transmission from one to 
another complex network, in this regime, becomes equivalent 
to the phenomenon of 1 // resonance illustrated in this article. 
This is essentially the main result of earlier work [29]. The 
main conclusions of Ref. [29] are illustrated by Fig. 1, which 
is obtained using the phenomenological LRT. 

What are the limits of this earlier result? The experiments 
[12,13] yield support to the dynamical rather than the phe¬ 
nomenological LRT, thereby generating doubt that the results 
of Ref. [27], although very attractive, may not completely 
reflect reality. It is important to stress that phenomenological 
LRT is a natural consequence of adopting the asymptotic 
time perspective replacing the waiting-times PDF //r) of 
Eq. (1) with i/r(r) oc l/r^. This way of proceeding, although 
generating the elegant mathematics of fractional derivatives, 
has as an ultimate effect the misleading discovery of the death 
of linear response [19-24,28]. We do not adopt the asymptotic 
time perspective but the special form of Eq. (1). This is not 
a unique way of connecting the long-time to the short-time 
regime. However, whatever form we adopt we are convinced 
that there will be a parameter playing the same microscope 
role of the parameter T of Eq. (1). It is reasonable to assume 
that an external perturbation may perturb either T or /x, or both 
of them. However, the perturbation of /x is incompatible with 
the assumption of a weak stimulus. In fact, /x is a consequence 
of the cooperation among the units of the network, and a 
perturbation may affect /x only if its strength is large enough 
to influence the interaction among the units of the network. 
Thus, an external weak perturbation can only have an effect on 
T, thereby making the dynamical LRT become the proper way 
to study the response of a complex network to a weak external 
stimulus, in accordance with the experimental results [12,13]. 

For these reasons, we can conclude that Fig. 2 is the original, 
and important, result of this paper. We hope that it may open the 
road to the dynamical solution of the problem of information 
transmission [48] from one to another complex network, a 
research topic that is still in its infancy. 
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APPENDIX A 

In this Appendix we record more details about the deriva¬ 
tion of both the average response (cr(t)) and the input-output 
correlation function 0(0 in the case of the phenomenological 
approach. 

When we adopt the phenomenological theory, we obtain 
for the average response to external perturbation, in the 
nonstationary case [cf. Eq. (27)] the following asymptotic 
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expression: 


(a(0) ki(fis^p) k 2 ilis,dp) 


£ fPp-l t ^ S ~ 1 

where the coefficient of the first term is 

Ti/i P — /is) r(l — /ip +/is) 
k\ (as,Up) = -— -, 

r ( i -iipWQis) r(2-ii P )r(nsy 

and the coefficient of the second term 

r(fi P — fis) 


(Al) 


(A2) 


k 2 iiis,t^p) = 


r(i — M'sW(M'p) 


(A3) 


Note the logarithmic corrections corresponding to /is = Pp 
with 


(a(0> _ sin (7 T/ 1 S ) lo gt A(ns) 




71 


t^s 


-1 


t^s 


-1 


(A4) 


and 

Mps) = 




n 


1 


_PS ~ 1 


-2y - 2Jr L ip s ) 


- COS(7 Ifls), 

(A5) 


where y is the Euler’s constant and Jr Liz) the logarithm 
derivative of the T function. These predictions are qualitatively 
equivalent to the dynamical theory predictions with the 
assumption that the system S has been prepared in the very 
distant past. 

The general expression of the correlation function in the 
phenomenological approach is given by Eq. (28). 

Nonstationary case I: /is <2 ,/ip <2. 

In this range the asymptotic approximation for the function 
R(t ) defined in Eq. (11) for both S and P is 


SiriTTLl „ 9 

R(t) ~ — -e~ 2 . 

71 


(A 6 ) 


The power-law properties of 40 (0, in the long-time limit, 
shift the dominant contribution to the integral of Eq. (28) to 
the range t' ~ t, this allows to adopt the approximation (A 6 ) 
inside the integral. It follows 

sin 7 r u c sin 7 r a p 

0(0 ~ ^ ^ 


TV 


TV 


X 


f ~ r)dx f x^ p - 2 ^ P (t - x)dx 

Jo Jo 


(A7) 

Using a generalized Newton binomial expansion of the power- 
law form of the functions 4V(0 we obtain the following 
expression: 

oo 


0(0 — 


SinTT/As sm7t/jip 


71 


71 


E 

n= 0 


/ 



1 (- 1 )" 

. n ) 

' (t + Tp) n +I^~ x 

t')dt ', 

(A 8 ) 


while the convolution with 40 (0 in the integral, leads to 


0(0 — 


SinTT/ls SUlTT/lp 


x 


7X 7X 

j.n+ijLp-1 

n + /ip — 1 


oo 


E 

n= 0 


1 — PP 
n 


(-0 


n 


it + T P ) n+ ^p 


-l 


Tin + tip + p s ~ 2)T(2 - p s ) 


T{n + tip) 


(A9) 


where we have used the fact that for t large the main 
contribution to the integral comes from the range of values 
x < r < t and therefore the asymptotic approximations for 
Rsit) an Rpit) are justified. The limit for t -> oo cancels 
the time dependence and leaves a sum which leads to the 
expression of Eq. (47) in the text. 

Nonstationary case II: /is < 2,/ap > 2. 

In this case the function Rpit) tends asymptotically to a 
constant value and therefore it easy to see, following analogous 
procedure to the previous case, that the correlation tends to 
zero. 


Stationary case I: /is > 2,/zp < 2. 


As mentioned in the text, from Eq. (30) it is enough to show 
that the third term on the right-hand side is zero. Such terms 
read as 

t 

4> s (t - t')dt't' ,Pp - 2 ^ P {i - t'y (A 10) 


sin 7 x /ip 


71 


l 


Using a generalized binomial expansion for both 4-0(0 and 
40(0 we get 


0(0 ^ 


sin 7T /Ip 


71 


E 

n,m 
m 


1 — l^p\ /I — ds 


(-D 


n 


n 


m 


(t + Tp) n+ > ip ~ i 


X 


(- 1 ) 


if + T s r+^~ 2 j 0 


f 


,n+m+nr-2 d ' 


(All) 


Carrying out the integration shows that the leading term in t 
vanishes for t —> oo Such a demonstration is valid also for the 
dynamical case in the same range of parameters [see Eqs. (51) 
and (52)]. 


APPENDIX B 


This Appendix is devoted to detailed calculations involved 
the evaluation of the asymptotic limits of the function 0(0 
and (cr( 0 ) in the dynamic approach. 

Herein only the parameter range /is < 2 ,/i P < 2 is consid¬ 
ered, the other ranges being analyzed in detail in the main text 
and in Appendix A. 

0(0 is given by the following integral: 

4>(0 = l 1r s (t,t')Vpit 9 t')dt', (Bl) 

Jo 

where Jrsitp') is defined in Eq. (10) and 4O(M0 in Eq. (12), 
each with the appropriate labeling. 0(0 can be decomposed 
as sum of four contributions 

4 

<D(0 = y>«V), (B2) 

1 = 1 

where 


Oi(0 = tx/fsit^pit), (B3) 


O 2 (0 = Vo(0 


= Jfsit) 



Rp(t")V P (t -t")dt"dt' 
t")R P it")Vpit - t")dt\ 
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O 3 (0 = V P (t) [ f Rs(t")1r s (t ~ 

Jo Jo 


l//\ jJf j+r 


f' 


= V P (t) / (t - t”)Rs(nir s (t - t")dt", (B5) 
f o 


m t' 

Rs(t")^ s it - t")R P {r)^p{t - x)dt'dt"dx. 

_ 

(B 6 ) 

Here J/s(0 and P (t ) are defined in Eqs. (13) and (36). Using 
the methods of Ref. [49], Eqs. (B4), (B5) can be evaluated in 
the asymptotic limit to yield 


Oi(0 




^Ps+Pp 


Ci C 2 C3 

, 4>2(0«t^zt. $ 3(0- 


-2 




t^s- 


t^p 


-1 


Since in the limit t 00 the three contribute vanish, it is 
not important the evaluation of the constants c\, c 2 , c 2 . To 
evaluate O 4 (t) we first need an analytical expression for Rs(t) 
and Rp(t). It can be shown that [49] 


Rs(t) ~ — 


sm(n/jLs) 1 


7TT£ S 1 


with the same expression, with the respective parameters being 
valid for Rp(t). Then we have 
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r (t + Ts — 1 0 1 fZs t /fZs 1 sin(7t/jLs) 
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On the other hand 
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Using for Rp(t) the same approximation of Rs(t ) Eq. (B2). 
we have 
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Rp(x)^p(t — x)dx 
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(B7) The function <£>4 (t) is 


OO 


sin it up sin tills t^ s+llp 1 / 

*4(0« ^ E - 


t 


n 


7T 2 (t + TpY P 1 {t + T S Y' 


n= 0 


t + Tp 


x 


1 - /I P \ F(n +p P +Ps~ l,Us ~ l,n+ p P + Hs,-ppp s ) 


n / (n + p P - l)(n + p P + ii s - 1) 

where F(a,b,c,z) is the hypergeometric function. Finally 

sin Tt /i p T (/i p + /x,s — 1 )(£ + Tp) 1- ^^ 


O 4 (0 




(1 - /xp)T(/xp + l)T(/xs - l)(t + T s Y s t l *p 


x F 


{lip — l,lip — 1 ,/xp ~\~ iis ~ 1},{ Up,Up + 1}, 


* + r- 




where ^ ({<^}, {Z?} ,z) is the generalized hypergeometric func¬ 

tion. In the limit for t -> oo 

sin 7 r fi p T (/i p + /is — 1 ) 

°°° = _ 7r(/xp - l)T(/xp + l)r(/x 5 - 1 ) 

x F [{/xp - l,/xp-l,/xp+/xs - 1},{/xp,/xp + 1},1] . 

(BIO) 

As far as the evaluation of the function (cr(f)) is concerned, 
in the dynamic approach it is given by Eq. (35) 

(a(t))=e[ 1r s (t,t')Vp(t')dt', (Bll) 

Jo 

which is to be evaluated in the parameter range /is < 2,/i P < 
2. Let Ps(t ) = Rs(t) + S(t), Eq. (Bll) reads 

{cr(t)) = € f dt'^p(t') f dxPs(x)Ys(t — x), (B12) 
Jo Jo 


which, after a transformation on the integration domain, turns 
into 


= € f dxfs(t - x)P s (x ) 

Jo 

The long-time limit shifts the main weight in the integral on the 
terms such that x < t. Therefore the approximation Eq. (B7), 
valid also for P s (t ) in the long-time limit, can be adopted. 
Inserting such approximation in Eq. (B13) leads to 
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dt'V P (t'). (B13) 


sinn/is . 

(cr(t)) —/ dxYs(J ~ x)x^ s / dt ^pit ). 
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After getting rid of the integral in t’ by direct integration 
of W P (t'), one is left with a simple convolution which is 
easy to analyze in Laplace transform. After some straight¬ 
forward algebra, extracting the two asymptotic leading terms, 


leads to 

(a(t)) ~ ki+ k 2 C s ~ tlp ~ l , (B15) 

that is, the asymptotic expression of Eq. (41), with the 
coefficients given by Eqs. (42) and (43). 
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Robust Control for Mobility and 
Wireless Communication in 
Cyber-Physical Systems With 
Application to Robot Teams 

Control of an autonomous robot team with the aim of optimizing network performance 
is discussed in this paper; the authors design a controller to ensure availability 
of communication resources. 

By Jonathan Fink, Student Member IEEE , Alejandro Ribeiro, Member IEEE, and 
Vijay Kumar, Fellow IEEE 


ABSTRACT | In this paper, a system architecture to provide 
end-to-end network connectivity for autonomous teams of 
robots is discussed. The core of the proposed system is a cyber¬ 
physical controller whose goal is to ensure network connec¬ 
tivity as robots move to accomplish their assigned tasks. Due to 
channel quality uncertainties inherent to wireless propagation, 
we adopt a stochastic model where achievable rates are 
modeled as random variables. The cyber component of the 
controller determines routing variables that maximize the 
probability of having a connected network for given positions. 
The physical component determines feasible robot trajectories 
that are restricted to safe configurations which ensure these 
probabilities stay above a minimum reliability level. Local 
trajectory planning algorithms are proposed for simple envir¬ 
onments and leveraged to obtain global planning algorithms to 
handle complex surroundings. The resulting integrated con¬ 
trollers are robust in that end-to-end communication survives 
with high probability even if individual point-to-point links are 
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likely to fail with significant probability. Experiments demon¬ 
strate that the global planning algorithm succeeds in navigating 
a complex environment while ensuring that end-to-end com¬ 
munication rates meet or exceed prescribed values within a 
target failure tolerance. 

KEYWORDS | Mobile ad hoc networks; motion planning; multi¬ 
robot systems 

I. INTRODUCTION 

The confluence of advances in wireless communication, 
sensing, computing, and control has led to the emergence 
of cyber-physical systems. Cyber-physical systems go be¬ 
yond the mere addition of cyber functionalities to physical 
systems being typified by a tight blending of cyber and 
physical components and functions. A canonical example 
of this class of systems is the autonomous robot team 
where a group of mobile robots cooperate to accomplish 
tasks assigned by human operators. Consider, for example, 
a search and rescue mission in a hazardous environment 
where a team of robots is deployed to scout points of in¬ 
terest. Designated lead members of the team move to 
specified locations while the remaining robots provide 
mission support by configuring a multihop wireless net¬ 
work that permits relaying of information back to the 
operators. Availability of wireless communications is 
critical for task accomplishment because communications 
are required to exchange information between robots as 
well as to relay information to and from the human 
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Fig. 1. System architecture. Task specification here represents a generic spatial application defined by a convex task potential function 'P(x) 
while providing a stream of data to the human operator. Individual robot components consist of the low-level robot control, estimation, 
and communication. We additionally assume that a subsystem is available to build an online model of radio communication in the environment. 
The focus of this work is on developing concurrent methods for routing and mobility control. 


operators. It then follows that deployment of the team 
requires integration of (cyber) wireless communications 
with (physical) mobility control. The cyber-physical 
perspective posits that this integration is not just a matter 
of adding wireless transceivers to the robots but requires a 
tight conjoining of communications and mobility control. 

Tight integration of communications and mobility 
arises in autonomous robot teams because availability of 
wireless infrastructure is unlikely in the harsh environ¬ 
ments in which they are to be deployed. In the absence of 
infrastructure, robots have to self-organize into a wireless 
network capable of supporting the necessary information 
exchanges. In this context, the spatial placement of the 
robots affects the feasibility of successful communication 
because some physical configurations cannot support re¬ 
quired transmission rates between, say, human operators 
and some lead members. We can therefore think of com¬ 
munications as determining a set of virtual obstacles that 
restrict the movement of the robots. However, this is not a 
hard limit as it is possible to reconfigure communication 
variables to alter the set of physical configurations for 
which communications are feasible. This yields a control 
loop that begins with communication variables limiting 
mobility and finishes with mobility determining commu¬ 
nication variables. In the end, successful deployment of an 
autonomous team of robots requires joint cyber-physical 
controllers that determine (physical) trajectories for the 
robots while ensuring (cyber) availability of communica¬ 
tion resources [1]. 

An architecture diagram for the cyber-physical control 
of autonomous teams of robots is shown in Fig. 1. As with 
any mobility control system, there is a block performing 


task specification, a second block executing the control 
law, and a third block conducting actuation and state esti¬ 
mation. The task specification block interfaces with the 
human operators and integrates robot observations and 
requirements to determine specifications that it passes on 
to the control block. These specifications come in the form 
of a potential function \F(x) and basal end-to-end commu¬ 
nication rate requirements a^ min —the subindex i here de¬ 
notes a source and the superindex k a destination; see 
Section II. The goal of the autonomous team is to minimize 
the potential \F(x) while ensuring that end-to-end com¬ 
munication rates exceed the minimum required level a^ min . 

The control block is the core of the cyber-physical 
system and the main focus of the work described in this 
paper. The purpose of this block is to determine physical 
control inputs x(t) that dictate the movement of the robots 
and network variables ct(t) that determine the communi¬ 
cation of information through members of the team. The 
control inputs x(t) are conducive to task completion, i.e., 
minimization of the task potential, while ensuring reliable 
network operation, i.e., communication at or exceeding 
rates a^ min when combined with network variables ct(t). 
Information about achievable rates between different loca¬ 
tions is necessary to determine mobility and communi¬ 
cation variables. This information is provided by a radio 
communication modeling block. Due to the difficulty of 
predicting these rates accurately, the output of this block is 
in the form of a statistical model of achievable rates. As we 
discuss in Section III, models that provide the mean and 
variance of achievable rates are reasonable in practice. 

The execution block consists of individual robots 
implementing the control law x(t) and routing packets 
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according to variables cx(t). Robots also take observations 
yi(t), e.g., a video feed, that they relay to task planning and 
perform position estimation X;(t) that they feedback to the 
control block. Using available technologies for mapping, 
control, and state estimation, each robot can estimate its 
position Xi(t) and control its velocity X{(t ) with respect to a 
common known map of the environment. Since we do not 
provide details about mapping or state estimation in this 
paper, we refer the interested reader to, e.g., [2]. 

A. Communication-Aware Mobility Control 

In the design of concurrent communications and mobi¬ 
lity control algorithms for autonomous robot teams there 
are three issues that arise naturally: 1) how to translate 
robot positions into information about the point-to-point 
rates at which pairs of robots can communicate with each 
other; 2) how will the cyber-physical control loop in Fig. 1 
utilize point-to-point connectivity information in order to 
ensure end-to-end network integrity; and 3) how to deal 
with uncertainty in the information about point-to-point 
channels between individual pairs of robots. 

Issues 1) and 2) are related to each other. A simple 
answer to issue 1) is to identify the ability to communicate 
with spatial proximity, possibly mixed with line-of-sight 
information to account for physical obstacles to radio pro¬ 
pagation. Using this model, point-to-point links between 
robots are feasible if they are sufficiently close and within 
sight of each other and infeasible otherwise. Matched to 
this simple model of point-to-point connectivity, the abi¬ 
lity to establish end-to-end connections is identified with 
the connectedness of the resulting graph. The problem of 
ensuring end-to-end network integrity then becomes one 
of letting robots move to accomplish their task while 
guaranteeing that the graph of point-to-point connections 
consists of a single connected component [3]-[10]. 

The advantage of this approach is that network-wide 
indicators of connectedness such as the second largest 
eigenvalue of the graph Laplacian or the ^-connectivity of 
the network’s graph are computationally tractable. It is 
therefore not difficult to insert connectivity constraints 
into the task planner and, if there is leeway in the selection 
of different spatial configurations, to find configurations 
that optimize connectivity in terms of the metric under 
consideration. Do notice that path planning is computa¬ 
tionally challenging even in the absence of communication 
constraints. The tractability of graph connectedness indi¬ 
cators simply means that the computational complexity of 
the cyber-physical control loop is comparable to that of a 
communication-unaware controller. 

Aside from the value of simplicity, associating small 
distance and line-of-sight with connectivity is not entirely 
justified. Early solutions to wireless networking for sys¬ 
tems of dynamic nodes were indeed based on the concept 
that reliable point-to-point links could be detected and 
established [11], [12]. However, it was quickly noticed that 
binary channel models were not accurate representations 


of wireless links [13]. As a consequence, attention was 
devoted to methods that explicitly consider the reliability 
of point-to-point links and choose routes to optimize end- 
to-end reliability metrics such as probability of delivery 
[14]-[16]. These more nuanced models of wireless connec¬ 
tivity can be incorporated without much difficulty into 
mobility control algorithms that use indicators of graph 
connectedness [17], [18]. If links are characterized by 
reliability indicators, the binary connectivity graph can be 
replaced by a weighted graph. Information about the 
number of connected components can be similarly ex¬ 
tracted from, for example, the second largest eigenvalue of 
the weighted graph’s Laplacian. 

Either based on binary or weighted graphs, connected¬ 
ness is an indirect indicator of the ability to establish end- 
to-end communications. Having a single connected 
component just implies that multihop paths from sources 
to destinations exist but does not determine whether the 
network formed by the robots is able to support desired 
communication rates. It is possible that some links in the 
connectivity graph can support rates that are too low, or 
that the resulting configuration contains bottlenecks that 
limit information flow. In the cyber-physical control loop 
depicted in Fig. 1, graph connectedness does not solve for 
the network variables a,(t). Instead, it restricts movement 
so that a necessary but not sufficient condition for the 
existence of feasible network variables a,(t) is not violated. 
In fact, one could argue that these controllers are not 
cyber-physical because the integration of wireless commu¬ 
nications with physical mobility is minimal. A true cyber¬ 
physical design should consider end-to-end rates and 
conjoin the design of mobility and network control as done 
in the systems described in the following section. 

B. Robust Cyber-Physical Control 

An honest metric of network integrity has to rely on the 
achievability of target end-to-end communication rates. 
Communication rates, however, do not depend solely on 
the spatial configuration but also on the manner in which 
packets are routed through the network. This leads to the 
problem of joint cyber-physical control of trajectories and 
routing variables [1], [19]. For given spatial configuration x, 
pairs of achievable point-to-point communication rates 
R ij( x ) are assumed available at the radio modeling block. 
The cyber part of the controller queries the modeling block 
and attempts to determine routing variables a.(x) that 
support desired communication rates afy nin . If this attempt 
is unsuccessful, spatial configuration x is marked as un¬ 
allowable. If the attempt succeeds, joint configuration 
(x, ct(x)) is deemed feasible and, in particular, spatial 
configuration x is deemed allowable. The physical part of 
the controller utilizes the information on allowable con¬ 
figurations x to plan physical routes that achieve 
minimization of the task potential \F(x) while ensuring 
that end-to-end rates exceed basal requirements at all 
points in time. See Section II. 
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Ensuring continuous reliable communication is chal¬ 
lenging, however, because significant uncertainty in 
achievable rates is inherent to the deployment of an auto¬ 
nomous robot team. Due to shadowing and small scale 
fading, even small variations in robots’ positions lead to 
significant changes in channel strength [13], [20], [21] 
which translate into commensurable changes in achievable 
communication rates between pairs of robots. Precise 
channel state information can be acquired through mea¬ 
surements over time, but mobility planning algorithms 
necessitate access to channel quality indicators at future 
positions to which the robots are yet to be deployed. Thus, 
from the perspective of the system in Fig. 1, rates R i; -(x) are 
random. The outputs of the radio modeling block are rate 
estimates R i; -(x) whose variances R i; -(x) are typically large. 
See Section III. 

Since achievability of end-to-end rates cannot be gua¬ 
ranteed because of rate uncertainties, it is natural to rede¬ 
fine network survivability in terms of the probability of 
end-to-end rates exceeding their basal rates a^ min . Packet 
routing variables ct(x) are then determined in order to 
maximize this probability [1]. The resulting routes are 
robust because they are chosen to minimize the probability 
of failure. At this point it has to be noted that the goal of 
the self-organized network is to maintain reliable end-to- 
end communication, not point-to-point communication. 
Consequently, we can think of robust routes as exploiting 
spatial redundancy to minimize the effect of point-to-point 
uncertainty in end-to-end communication rates. Indeed, 
by splitting traffic flows between various neighboring ro¬ 
bots, we can ensure that while failure of a particular link 
may reduce end-to-end communication rates, it does not 
interrupt them completely. See Section IV. 

The integration of robust routing with the physical con¬ 
trol loop is discussed in Section V. We develop local plan¬ 
ning algorithms for simple environments (Section V-A) that 
we test in simulations for teams composed of three and four 
robots (Section V-B). These simulations demonstrate that 
the performance of local controllers is hindered by local 
minima. To overcome this problem as well as to handle 
complex environments, we leverage the local controllers to 
design global planning algorithms (Section V-C). Global 
planners are tested with experimental results in an indoor 
office space at the University of Pennsylvania (Section V-D). 
The experiments demonstrate that the global planning 
algorithm succeeds in navigating a complex environment 
while ensuring that end-to-end communication rates meet 
or exceed prescribed values within a target failure tolerance. 
We close the paper with concluding remarks (Section VI). 

II. COMMUNICATIONS AND MOBILITY 

Consider a team of N robots and denote their positions as 
Xi, for i = 1,... ,N. The robots are kinematic and fully 
controllable which allows us to consider simple mobility 
models of the form Xi(t ) = Uj(t), where Uj(t) is the control 


input to robot i. A human operator is located at the fixed 
operation center that we index as i = 0 at position xo- 
Further define vectors x := (xo,.. . ,Xn) G R 2 ^ n+1 ^ and 
x := (xo,... ,xn) £ R 2(N+1) to group all positions and 
velocities, respectively. The task assigned to the team is 
specified through a generic scalar convex task potential 
function \F : R 2 ( N+1 ) —» R. If the potential minimum \F m i n 
is attained at x*, i.e., if \F(x*) = \P m in, the configuration 
x* satisfies task completion. For example, if a designated 
leader agent i must visit a target location x^ goa i £ R , we 
can define \F(x) = \\x£ — :v, goa i|| 2 * The minimum \P m i n = 
0 is attained by any configuration x* = (xo, • • •,x^,..., 
Xn) for which X£ = x^ goa i, or equivalently by any member 
of the set x* G {x = (x 0 ,..., X£,... , x N ) : X£ = x^ goa i}. 
Irrespective of the particular form of \F(x), the control 
problem is to find velocities x(t) such that at some time tf 
the team configuration x(tf ) = x(0) + Jq x(t)dt satisfies 
task completion in that we have \F(x(tf)) = \F m i n . 
Mathematically, we can write this mobility control formu¬ 
lation as 


min 

subject to 



x(t) = x(0) + 


x(u) du. 


o 



As robots move to accomplish their task, they maintain 
end-to-end data communication flows between members of 
the team and/or members of the team and the operation 
center. Information flows are indexed as k = 1,.. ., K. 
Flows may have multiple sources and multiple destina¬ 
tions. The set of destinations of the feth information flow is 
denoted as dest(fe). For agent i and flow k, the variable 
a ! min represents the required communication rate between 
agent i and any of the agents in the set of destinations 
dest(fe). For example, if the only communications of inter¬ 
est are from the lead robot £ to the operation center, there 
are only K = 1 flows. Since the flow k = 1 is intended to 
the operating center, dest(l) = 0 and a^ min denotes the 
minimum level of service for the communication from the 
leader to the operating center. All other variables 

of,min = 0 are nul1 - 

We model point-to-point connectivity through a rate 
function R i; -(x) = Rij(xi,Xj ) that determines the amount of 
information that agent i at position Xj can send to agent j at 
position Xj. Since direct communication between the 
source and the destination of an information flow is not 
always possible, terminals self-organize into a multihop 
network to relay packets for each other. Packet relaying is 
determined by routing variables af- which describe the 
fraction of time node i spends transmitting data for flow k 
to node); see Fig. 2. Thus, the product a^Rij(x) determines 
the rate of point-to-point information transmission from 
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task specifications in (1) and (3). Reducing \F(x) as per (1) 
and ensuring network integrity as per (3) may be 
conflicting requirements. We therefore replace (1) by a 
concurrent search of trajectories x(t) and routes cx(t) so 
that the task potential is minimized without ever breaking 
communication connectivity. Mathematically, we write 
this objective as the optimization problem 


Fig. 2. Communication network. The nodes are deployed to support 
end-to-end rates from node i to destination (or flow) k. Routing 
variables aft determine the fraction of time node i sends packets to 
node j for flow k. Rij is defined as the supported rate of the wireless 
channel from node i to node j. 


i to j. If we consider the transmission to all neighboring 
terminals for which R i; -(x) > 0, the total rate at which pa¬ 
ckets leave agent i is af-Rij(x). Likewise, the total rate 
at which i receives packets from other terminals is 

The information rate af(cx,x) 
available for flow k at source i is the difference between 
outgoing and incoming rates 


N 


N 


af(oc,x) = J2 a fj Ri i( x ) ~ a ji R i‘( x ) ( 2 ) 


i=o 


j=0,jydest(fe) 

^ v__. . 


Outgoing packets 


Incoming packets 


where we define the vector a, grouping all routing 

variables aL Notice that the variables a\- represent time 

1 

slot shares and must therefore satisfy 0 < af- < 1 for all i, j , 
and k. It must also be that Y'. h o& < 1 for all i to ensure 
that the sum of all time shares at terminal i does not exceed 
1. It is possible to, alternatively, require Y2i j,k a tj < 1 if we 
wish for only one link to be active at any time across the 
entire network. 

Routing variables o t and configuration-dependent rates 
Rij(x) determine the set a? 1 (a., x) of end-to-end commu¬ 
nication rates from each node i and flow k as per (2). The 
task specification requires that end-to-end rates exceed 
the minimum threshold a^ min . Therefore, integrity of the 
communication network necessitates that 



(a, x) > cl- 

\ / l, 


min’ 


for all i, k. 



Notice that a| 2 (cv,, x) is a function of positions x and rout¬ 
ing variables o t. To control end-to-end connectivity, i.e., to 
satisfy (3), we can resort to control of positions x, to 
control of routes a,, or both. 

Since communication is necessary for task completion, 
the mobility control problem as summarized in (1) is 
redefined. The new goal is to find algorithms and control 
policies that govern robot motions in order to satisfy the 


min 

a(t),x(t),tG[0,t / ] 



subject to af(oc(t),x(t)) > a l fe min 




where rates a| 2 (a,(t),x(t)) are given by the expression 
in (2) with cx = a(t) and x = x(t). 

A drawback of the formulation in (4) is the difficulty of 
ensuring that the constraints in (3) are satisfied. As seen 
in (2), rates a| 2 (oq x) depend on the link reliabilities R i; -(x), 
which are difficult to estimate. Propagation knowledge is 
used by the radio communication modeling block in Fig. 1 
to provide rough rate estimates Rij(x). Agents can further 
measure received signal strength indicators (RSSIs) to 
refine the rate estimates for the current configuration x(t); 
see, e.g., [22]. The challenge here is that Rij(x) estimates 
are needed not only for configuration x(t), but for nearby 
configurations to which the robots may move. The high 
variability of wireless channels makes Ry(x(t)) a poor 
predictor of Rij(x) even if x is close to x(t). In general, 
channel estimates R i; -(x) at future positions x follow from 
radio propagation modeling and corrections from RSSI 
observations. 

In formal terms, uncertainty of channel estimates 
means that the variances R i; -(x) of channel estimates R i; -(x) 
are typically large for possible future positions x. 
Variances R i; -(x) can be provided by the radio modeling 
block and are therefore assumed to be available at the 
controller along with the mean estimates Rtj(x). We seek 
to redefine (4) in a manner that takes into account this 
probabilistic formulation of channel rates. The important 
observation here is that if point-to-point link rates become 
random, so do the rates a| 2 (a.,x) of end-to-end commu¬ 
nication flows [cf. (2)]. Consequently, it is not possible to 
guarantee satisfaction of the constraints in (3). Rather, we 
introduce a reliability tolerance e and require that for all i 
and k 


af(a, x) > af 


mm 


> 6. 



That is, we require that the end-to-end link between all 
sources i and the destinations of all corresponding flows k 
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exceed their minimum required level of service with pro¬ 
bability larger than e. As in the case of the rate require¬ 
ment in (3), we can satisfy (5) by controlling ol and x 
separately or jointly. 

In order to robustly satisfy the networking constraints 
in (5), the concurrent routing and mobility problem (4) is 
replaced by 


min 4/(x(tf)) 

a(t),x(t),te[0,t/] 


subject to 


4( a > x ) > a*i 


mm 
t 


> e 


x(t ) = x(0) + J x(u) du. 


o 


( 6 ) 


The focus of this paper is the solution of (6). 

The problem formulation in (6) inherits some standard 
complications from the control formulation in (1). The 
problem is infinite dimensional and due to, e.g., the 
presence of obstacles, not convex. The concurrent search 
in (6) is further complicated by the entanglement of the 
routing and mobility problems. We deal with this en¬ 
tanglement by fixing x and selecting a in a manner that 
optimizes the reliability P[a^(ct, x ) — a fmin] (Section IV). 
We finish with local and global searches on positions x 
to minimize 4/(x) while keeping reliabilities above the 
e threshold (Section V). 


signal strength Pr(xi, xj) that we cascade into models of the 
packet error rate and the supported communication rate 

Rij(xt,Xj). 

Received signal power Pn(xi,Xj) is determined by three 
phenomena: path loss due to the distance from the source, 
shadowing due to obstacles in the propagation path, and 
multipath fading that arises as a result of reflections and 
refractions. Of these three phenomena, path loss and sha¬ 
dowing can be incorporated into a predictive model with 
relative ease. Fading, however, is difficult to predetermine. 
Multipath fading arises due to a spatial wave interference 
pattern generated by reflections and refractions of the 
electromagnetic wave which, as a consequence, inherits a 
space constant in the order of the wave’s wavelength. 
Thus, in order to use ray tracing models to predict the 
interference pattern we need to locate all potential 
reflectors and refractors with precision smaller than this 
wavelength. While plausible in principle, this is intractable 
in practice because wavelengths at common operating fre¬ 
quencies are too small—for example, with radios operating 
at 2.4 GHz., the wavelength is about 12 cm. Given this 
difficulty we adopt the following model for the received 
power Pr, dBm (x*, Xj) = 10 log(PR(xi, *j)) measured in dBm: 


PjydBm (v i Xj ) 



10n * log(||x; — Xj 

V 

Path loss 



Shadowing 



F ading 



III. POINT-TO-POINT RATE MODELING 

With robots i and j located at positions Xj and Xj, we seek to 
develop a probabilistic model for the supported communi¬ 
cation rate Rij(x) = Rfj(xi,Xj ) between them. More specif¬ 
ically, we wish to model its expected value Rtj(x) = 
Rij(xi,Xj ) and variance R;j(x) = Rij(xi,xj). Our intent is to 
use simple radios that do not perform rate or power 
adaptation—we use 2.4-GHz Zigbee radios in our experi¬ 
ments; see Section V-D. In that case, the communication 
rate R^x^Xj) is a function of the packet error rate of the 
channel, which in turn is a function of the signal-to-noise 
ratio [23]. We therefore focus on models of the received 


where the term T is a zero-mean Gaussian random varia¬ 
ble with variance o 2 ^ modeling fading effects. The term L 0 
is the measured power at a reference distance do from the 
source, n is a path loss exponent, and W(xj,Xj) is a non¬ 
smooth function to model shadowing as a function of the 
number of obstacles between source and destination. 
Experimental data collected in an indoor office environ¬ 
ment at the University of Pennsylvania along with the 
function in (7) having parameters fit to the experimental 
data are depicted in Fig. 3. The parameters in Fig. 3 are 
L 0 = —51 dBm for do = 1 m, n = 2.1, W(xj,Xj) = 0 for 
line-of-sight links and W(xi,Xj) = 7.6 dB for non-line-of- 
sight links, and ajr = 32 dB 2 . Notice that fading can cause 
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Fig. 3. Experimental characterization of received power model (7) based on lOOO samples when we classify wireless channels based on 
line-of-sight. (a) Line-of-sight. (b) Non-line-of-sight. 
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variations in received power in the order of d=10 dB. This 
is verified by the standard deviation of JF, which is crjr ^ 
5.7 dB. Recall that a 10-dB difference corresponds to a 
change of an order of magnitude for the power measured 
in linear units. 

Translation of received signal strength P R (xi,Xj ) into 
packet error rate p e (P R (x(, Xj)) depends on the type of 
modulation and the choice of error correcting codes. 
Regardless of specifics, a generally good approximation to 
the packet error rate is 


the controller is agnostic to the particulars of the radio 
propagation model and relies only on the ability to query 
for means and variances. It is reasonable to expect that 
techniques for spatial mapping of point-to-point communi¬ 
cation rates will outperform distance-based models when 
considering unknown or highly dynamic environments. 
However, as demonstrated by the experiments in 
Section V-D, the robust satisfaction of communication 
constraints affords the possibility to rely on coarse models 
of point-to-point communication rates while still ensuring 
maintenance of quality-of-service requirements. 


Pe(Pn(xi,Xj)) = erfc 



P(Xi,Xj) 

P Nn 


( 8 ) 


where P^ 0 is the noise power, erfc(x) is the complemen¬ 
tary error function, and k is a constant that depends on 
modulation and coding [23]. The translation into commu¬ 
nication rates Rij(xi,Xj ) is now straightforward as it simply 
requires multiplying the information rate of transmitted 
packets Ro by the probability of successful decoding, i.e., 
Rij(xi,Xj) = Ro[l - p e (P R (xi,Xj))]. 

The randomness of the fading term T in (7) propagates 
into randomness in the error probabilities p e (P(x(,Xj)) and 
subsequently into randomness in the rates Rij(xi,Xj). To 
compute the mean and variance of R[j(xi,Xj ) we need to 
recall that T is a Gaussian random variable. We can then 
invoke the delta method [24] to approximate the mean and 
variance of R^x^Xj) in terms of the mean and variance of 
Pr (xj, Xj ) and the derivatives of the function p e (P R (xj,Xj)) 
in (8). The complete mapping from distance to link rates is 
depicted in Fig. 4. Observe that except for very short 
distances smaller than 3 m there is significant variability in 
achieved link rates. See [1, ch. 4] for details. 

While the model we present in this section describes 
the statistics of the supported rate R^x^Xj) in terms of 
distances ||x; — x ; -11, we emphasize that this is not a limi¬ 
tation of the proposed cyber-physical controller. In fact, 
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Fig. 4. Mapping from distances x, - xj \ between a transmitter at 
location x-, and a receiver at position xj to normalized link rates 

f^ij (X/ 5 Xj ) / Rq . 


IV. ROBUST ROUTING 

The control loop in Fig. 1 embeds a cyber optimization 
component and a physical reconfiguration module. The 
cyber optimization component optimizes communication 
routes for given positions, while the physical component 
explores spatial reconfigurations that maintain network 
integrity and are conducive to task achievement. The phy¬ 
sical module relies on the cyber module for the determi¬ 
nation of configurations that are feasible from the 
perspective of maintaining target communication rates 
with some desired reliability. 

The major difficulty in designing the cyber component 
of this control loop is the uncertainty in achievable trans¬ 
mission rates between nearby agents. As we discussed in 
Section III, simple experiments suffice to demonstrate that 
achievable rates in wireless links are difficult to predict. 
Therefore, assuming that the actual channel rates R i; -(x) 
coincide with their estimates Rfj(x) may result in a drastic 
difference between predicted and actual end-to-end rates. 
This mismatch subsequently leads to situations in which 
the communication optimization module asserts network 
integrity when actual rates do not suffice to support the 
task assigned by operators. 

A simple way to account for the uncertainty in R^(x) is 
to discount Ry(x) in order to reduce the likelihood of 
having actual rates smaller than the assumed value. This is 
definitely possible but would result in underutilization of 
communication resources. A better way to account for the 
uncertainty in channel rates is to recall that end-to-end 
rather than point-to-point failures are relevant. With re¬ 
ference to Fig. 2, say that robot i decides to convey infor¬ 
mation to k by forwarding it through L If the rate Rn(x ) 
happens to be substantially smaller than estimated, com¬ 
munication is interrupted. If, on the contrary, robot i de¬ 
cides to send part of the information through l and some 
other part through j, the communication between i and k 
survives as long as the rate R;j(x) is larger than estimated. 
This example illustrates that it is possible to exploit spatial 
redundancy through traffic splitting in order to devise ro¬ 
bust routes that guarantee small changes in end-to-end rates 
despite large variability in point-to-point rates Rij(x) [25]. 

To develop robust routing algorithms, start by noticing 
that computing the probability in (5), which is part of the 
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problem formulation in (6), necessitates modeling the 
probability distribution of aj^a., x). This is difficult in 
general. However, if we explicitly consider the stochastic 
model of point-to-point links via their means and va¬ 
riances, we can compute the mean and variance of end-to- 
end rates 0 ^( 0 ., x) as 


aj^ct, x) := E[af(a., x)] 


E a ^«- E 


af(a., x) := var|af(o*.,x) 


} jydest(fe) 

k, 


'V 11 

] j<0 dest(fe) 


( 9 ) 


E(a- ) Rij( x )+ E (“p) fy‘( x )- ( 10 ) 


A proxy for the probability in (5) is the difference between 
a| 2 (oy x) and its mean af(aL, x) normalized by its standard 

deviation x). Indeed, a typical approximation for 

the probability constraint in (5) is the condition 

P\( a, x) := -— . > ^ (e) (11) 

yaf(a,x) 

for some function T> _1 (e). If the probability distribution of 
rates R^x) is Gaussian, (11) is equivalent to (5) if T> _1 (e) is 
the inverse of the normal distribution’s cumulative 
distribution function. For other probability distributions, 
we can apply Chebyshev’s inequality with T> _1 (e) = yl/e 
to demonstrate that (11) is a sufficient condition for 
satisfying (5) though this will usually be a conservative 
approximation. 

For given positions x, the goal of the cyber optimiza¬ 
tion component is to find routing variables a. that satisfy 
(11) which is equivalent to, approximate to, or a sufficient 
condition to satisfy (5). In either case, there is some inde¬ 
terminacy because there is a nonunique set of variables ol 
that satisfy the corresponding inequality. This indetermi¬ 
nacy provides a degree of freedom that can be used to 
increase the reliability beyond the required level. For a 
given configuration x, we would like to find routes 
a. = ct(x) that provide the maximum possible reliability. 
Doing so is not conceptually difficult—it requires determi¬ 
nation of routes ct(x) that maximize the probability pro¬ 
xies Pf( ol, x) in (11) in some sense. Considering that there 
is a rate variable af for each source-destination pair, there 
are several possible choices to define this maximization. 
One possibility is to maximize the sum of these probability 
proxies. Another possibility is to select routes ct(x) that 
make the smallest of these terms as large as possible. 
Maximizing this minimum implies that the constraints in 
(11) are satisfied with significant slack and that there is 


significant liberty to change the physical configuration 
without violating communication constraints. This free¬ 
dom of movement facilitates implementation of the physi¬ 
cal mobility control block as we discuss in Section V. 

An important observation here is that the probability 
proxy constraints in (11) can be written in a manner that 
defines a cone in the joint space of routing variables and 
end-to-end rates [1, ch. 5]. A consequence of this property 
is that the optimization problems that need to be solved to 
maximize them are second-order cone program (SOCPs). 
SOCPs are a particular class of convex optimization 
problem that can be solved by efficient polynomial-time 
algorithms. For the problems considered here, the compu¬ 
tational complexity of these algorithms is represented as a 
polynomial function of the number of agents N and the 
number of flows (destinations) K, as O ((K ■ N 2 ) ’ ). In 
practical implementations, the N 2 term can be reduced by 
eliminating links for which rate estimates Rij(x) are below 
a certain threshold. 

Recall that the motivation for robust routing algorithms 
is to reduce uncertainty in end-to-end communication 
rates. We do so by taking advantage of spatial redundancy, 
for which it is necessary to split traffic among various 
different routes. It is fitting that we do expect to obtain this 
type of solution from the maximization of the probability 
proxies Pf(oL,x) in (11). Indeed, increases in these terms 
can be brought about by either increasing the mean 
d^(oL,x) or decreasing the variance af(a., x). Since the 
mean is a linear function of a,, traffic splitting has a minor 
effect on af( ol,x). However, traffic splitting reduces 
af((x,x) by a factor proportional to the splitting because 
the variance is a quadratic function of ol —recall that 
a t j < 1. Thus, traffic splitting tends to increase the proba¬ 
bility proxies because it keeps a| 2 (a,,x) more or less con¬ 
stant and reduces a } -(o *., x) significantly. 

V. CONTROL 

As per (6), the objective of mobility control is to decrease 
\F(x) while satisfying the probability proxy constraints in 
(11)—which are equivalent or approximations to 
P[a| 2 (a.,x) > a| 2 mill ] > e. To check for the feasibility of a 
cyber-physical configuration (ot,x), we define the prob¬ 
ability margin as the minimum slack in probability proxy 
constraints across all flows and sources 



Notice that a necessary and sufficient condition for 
feasibility of the physical configuration x is to have 
i/( ot(x),x) > 0, with routing variables ct(x) as given by 
the solution of the SOCP described in Section IV. A 
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sufficient condition for feasibility of physical configuration 
x' is the existence of a cyber configuration ol for which 
x.') > 0. In particular, for x' close to x, we expect to 
have z/(ct(x), x') > 0 since the channel statistics at x and 
x' are close. 

A. Gradient Controller 

In general, gradient controllers define velocities x(t) 
proportional to the negative gradients — V\F(x(t)) of the 
task potential. Since the problem in (6) is subject to com¬ 
munication constraints, a local controller will be based on 
potential gradients projected onto the feasible set 
is(ol x) > 0. The complex description of the feasible set, 
however, precludes computation of projected gradients. 
Instead, we consider the probability margin z/(ol(x),x) 
and modify the potential 4/(x) by adding the probability 
margin constraint into the objective through a barrier 
function 



Fig. 5. Deployment via local control law (14) for a system with a 
fixed access point, relay node Xi and lead node x 2 , which is 
controlled by a time-varying task potential 'Ppc(f)). 


ft(x) := ’J'(x) - logO(tt(x),x)). (13) 


Since nonnegativity is necessary and sufficient for feasibi¬ 
lity of physical configuration x, the potential f2(x) in (13) 
is defined if and only if physical configuration x is feasible. 

The local control law is defined to implement gradient 
descent on the modified potential fl(x) introduced in (13), 
which, in explicit terms, is given by 



-W(x(t)) + 


V x z/[oi(x(t)),x(t)] 

z/[a(x(t)),x(t)] 



The term V\F(x(t)) in (14) drives the system to satisfy the 
task potential. The term V x z^[a(x(t)),x(t)]/z/[a.(x(t)), 
x(t)] serves as a barrier that drives robots away from con¬ 
figurations for which there is a low probability of exceed¬ 
ing the desired reliability in end-to-end rates. 


B. Simulation Results 

We implement the local controller with mobility con¬ 
trol inputs given by (14) and communication variables 
obtained from Section IV. Computing controls based on 
local optimization of the network-level end-to-end rates 
allows for a method of realizing team deployment while 
maintaining the necessary level of network connectivity. 
Fig. 5 depicts an example deployment with three robots for 
a time-varying task potential 


| -^2,goal — (4, 0), t < 40 
^(x(t)) = < X 2 , goal = (8,0), t < 60 (15) 

l -^2,goal (0,5), t 60. 


Most importantly, this example demonstrates conver¬ 
gence of the task potential \F(x) while maintaining 
P[cij > d^min] > e as depicted in Fig. 6(a) and (c). How¬ 
ever, it is also interesting to observe that when the task 
potential is minimized, e.g., t < 30 s, the local control law 
(14) maximizes the probability of each end-to-end rate 
exceeding its minimum threshold. When the task potential 
switches so that 4/(x) is no longer minimized, the proba¬ 
bility margin is reduced so that the primary objective, 
minimization of 4/(x), is prioritized. Finally, Fig. 6(b) de¬ 
picts the end-to-end rate of the node X2 that must remain 
above a2, m in = 0.1. Remember that Rij(x) is a stochastic 
rate that affects the end-to-end rate. The envelope around 
q .2 in Fig. 6(b) depicts the effect that different realizations 
of communication channels R^-(x) will have on the end-to- 
end rate. Since the pursuit of minimization on \F(x) is 
constrained to have a probability margin z/(ql(x),x) > 0, 
the end-to-end rate exceeds its threshold in the presence of 
deviations to Rij(x). 

We perform a four robot simulation, depicted in Fig. 7, 
to demonstrate how the complexity of the objective func¬ 
tion (13) increases with more nodes. Similar to the three 
robot simulation above, the task potential function is time 
varying so that 4/(x) = 0 for t < 50 s and 4/(x) = ||x 3 — 
(9, 0)|| for t > 50 s. When t < 40 and the task potential is 
already minimized, control of Xi and X 2 is based solely on 
the maximization of log is(ol(x), x). From the symmetric 
initial configuration, the maximization drives Xi and X 2 
towards a local maxima where they would be positioned at 
the same point. In fact, if we consider all configurations 
where Xi < X 2 and both agents are on the line connecting 
the access point, we find that there are two local maxima as 
depicted in Fig. 8. 

To further illustrate the virtual terrain of the objective 
function (13) that drives local control, we observe the 
convergence of 4/(x(t)) as depicted in Fig. 9(a). For 
times 70 s < t < 90 s, convergence of \F(x(t)) slows, 
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Fig. 6. Performance of the local control law (14) demonstrating 
convergence of X \ r (x(t)) in (a), the maintenance of expected end-to-end 
rate greater than the threshold of a 2 , m in = 0.1 in (b), and the 
P[a, > a, min] > 0.6. The envelope surrounding ( a 2 in (b) depicts the 
60% confidence interval for realizations of the end-to-end rate 
with stochastic /?,y(x). 


i/(ol(x),x) as depicted in Fig. 9(c) shows little change, 
and x\ and X 2 cease progress. Fortunately, perturbations in 
the simulation are enough to pull the system out of the 
local minima so that it can achieve a global minimum at 
t = 140 s. As the number of agents increases, the fre¬ 
quency of local minima in — j/(ol(x), x) becomes more and 
more of an issue for local control. The addition of obstacles 
into the environment adds further difficulties as it not only 


Probability margin, .vj =5. 



r l 


Fig. 8. Evaluation of v{ a(x),x), Vx e {x = {xi,x 2 ,x 3 ) : Xi < x 2 < x 3 , 

x 3 = (5, 0), x lf and x 2 online connecting access point and x 3 }. 
Local maxima occur at x x = x 2 = 1.9 m and Xi IS m f x 2 3 m. 


affects feasible configurations x due to collision con¬ 
straints, but also introduces nonsmooth components in the 
underlying point-to-point communication links Rjj(x). 


C. Global Planning 

Gradient-based control will drive the system towards 
local minima of (13). However, as we consider larger 
teams with more complicated network topologies and 
complex environments with obstacles, local minima be¬ 
come more of an issue. We propose that a global search of 
(6) is necessary in order to accomplish the high-level 
situational-awareness tasks we are interested in. 

To consider global search of (6), we redefine the prob¬ 
lem to be more amenable to motion planning approaches 
from the robotics literature. Let X be a bounded connected 
open subset of R 2N that represents the full joint state space 
for the team of robots where Xq is the initial configuration 
of the team. In general, the goal region will be defined as 
X g = { x : 'J(x) < \F m i n + <5}. In the telepresence applica¬ 
tion where a lead agent must visit the location x^ g , \F(x) = 
\x£ — X£g\\ 2 and X g = {x : \\x£ — x^ g || < b}. The obstacle 
region X 0 b s contains any configuration that places an 
individual robot on a physical obstacle and the infeasible 





Fig. 7. Snapshots from a four robot trial. The end-to-end rates and probability of meeting the problem specifications are depicted in Fig. 9. 
^(x(f)) = ||X3 — (9, 0) || for t > 50 s and ^(x(f)) = O for t <50 s: (a) t = 40 s; (b) t = 75 s; (c) t = 140 s. 
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Fig. 9. fal Convergence of'P(x), (b) end-to-end rates, and 

(c) probability of success for the four robot trial depicted in Fig. 7. 


region represents configurations where it is infeasible to 
satisfy the network constraint (11) 


explore the workspace, adding states in a tree structure T 
until a point x G X g is added to the tree T. At each step 
of the RRT algorithm, we pick a random state x = 
RANDOMSTATE(X, T) and select the point x m j n = 
NEAREST(T, x) that is closest to x among those that 
have already been added to T. We then attempt a virtual 
drive from x m i n to x using the subroutine x = 
EXTEND (x m i n , x). The point x is the first intersection 
of this virtual path with the border of the free space region 
Xf ree , or, if the border is not reached, the random point x. 
The point x is then added to the tree T as a branch con¬ 
nected to the point x m i n that was closest to x among the 
preexisting elements of the tree. The algorithm terminates 
when a point x G X g in the goal configuration is added to 
the tree. 

A common problem encountered when applying RRT 
algorithms to high-dimensional state spaces is that com¬ 
putation of Nearest is inefficient for increasing tree sizes. 
We adopt the strategy of storing the tree T in a KD-tree 
data structure which stores states in R d by recursively 
subdividing based on alternating axis-aligned hyperplanes 
[27]. This enables approximate nearest neighbor calcula¬ 
tions that maintain performance even as the dimension 
increases. However, there are two additional difficulties 
that arise when applying standard RRT algorithms to solve 
the specific high-dimensional network connectivity prob¬ 
lem in (6): 1) the verification of feasible states as Extend is 
used to expand the tree towards x; and 2 ) the prohibitive 
cost of uniformly exploring Xf ree for our high-dimensional 
problem with slow-to-compute constraints. We discuss this 
two issues in the following two sections. 



1) Efficient Verification of Feasible States: The 
EXTEND (xi, X 2 ) algorithm attempts to virtually drive 
the system from Xi towards X 2 by successively verifying 
that points along the line connecting Xi and x 2 are inXf ree . 
It returns the state x as the closest state to x 2 such that all 
states sampled with precision Ax between Xi and x 2 are 
in Xf ree . I n traditional motion planning applications, 


The free space Xf ree is thenX \ (X 0 b s U Xi n f). Finally, a path 
in X is parameterized by a scalar s > 0 and given by 
cr : [0, s] —> X. A feasible path, and solution to our global- 
planning problem, is then cr : [0,s] —» Xf ree such that 
cr(0) = Xfrfrt and cr(s) G X g . 

The dimensionality of our problem and the high com¬ 
putational cost of verifying a state as in Xf ree makes deter¬ 
ministic search algorithms impractical. Instead we turn to 
probabilistic search methods that offer good space filling 
properties and efficient exploration of an unknown space 
like the rapidly exploring random tree (RRT) algorithms 
[26]. The basic structure of an RRT, as depicted in Fig. 10, 
is to start with an initial point Xq and expand to fully 



/ Irf 


Fig. 10. Graphical depiction of the RRT search process visualized in R 2 . 
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verification that x E Xf ree is based on an algebraic con¬ 
straint or collision query with a multitude of efficient 
methods for doing so [28]—[30]. While the necessary com¬ 
putation to determine x 0 X Q bs is typically small, compu¬ 
tation of x ^ Xi n f requires a solution of the SOCP in 
Section IV and can be costly for high-dimensional systems. 

Consequently, we store ct(x) for every node in T and 
recompute ol(x) only when necessary to extend new 
states. By relying on the fact that an optimal robust routing 
solution ct(x) will be feasible for neighboring states, it is 
often possible to extend x towards x without the costly 
overhead of numerical optimization. 


2) Biased Space Sampling: Random states x are chosen to 
sample the space X C R 2N according to a probability dis¬ 
tribution p x ( x ) representing the belief about configuration 
x being part of a feasible path cr(s). If nothing is known 
about cr(s), we choose p x ( x ) uniform in the space X. In 
general, at least the final configuration is known in that 
cr(s) E X g . We can then bias the distribution by designing 
p x (x) to choose configurations in X g with higher proba¬ 
bility. Goal biasing improves efficiency of RRT algorithms 
by reducing the number of samples necessary to find a 
feasible path cr(s) in the high-dimensional space X C R 2N . 

In many cases of interest, however, the volume of X g is 
comparable to the volume of X and goal biasing offers little 
improvement over uniform sampling. In, for example, the 
telepresence application, the goal position of the leader x^ g 
is known, but the positions of the remaining robots are 
free. Thus, goal biasing would reduce the exploration cost 
along the components associated with X£ but keep the cost 
of exploring the remaining 2(N — l) dimensions fixed. To 
further reduce exploration cost in this case we construct a 
prediction X g C X g of the final configuration and bias 
sampling towards this prediction. 

Constructing a final configuration prediction X g is task 
specific. We describe here a method applicable to the tele¬ 
presence application. To determine the configuration pre¬ 
diction X g we determine configuration predictions X i:g for 
each robot and compute X g as the Cartesian product of 
these individual sets, i.e., X g = fl^Xi g . Notice that for the 
lead robot we can makeX^g = {x£ E R 2 : \\x£ — x^ g \\ < <5}. 

Observe now that X C R 2N is the Cartesian product 
X = Yii=i % of the N decoupled spaces X* E R corre¬ 
sponding to each individual robot. If we further assume a 
homogeneous team of robots, then all robots operate in the 
same space X; = Y, with a common set of physical ob¬ 
stacles Y 0 bs, and consequently a common free space Yf ree = 
Y \ Y 0 b s . It follows that the joint free space Xf ree is also a 
Cartesian product of N identical sets Yf ree minus those 
configurations for which a network cannot be established 
with sufficient reliability 



(Yf ree ) N \ Xinf. 



While infeasible network configurations are captured by 
Xinf as given in (16), Xf ree can otherwise be described by 
the free space of individual robots. 

To exploit this observation, we first determine an 
obstacle-free path 7 : [ 0 , s] —> R 2 such that 7 ( 0 ) = xo is 
the position of the operating center and 7 (s) E X^ g . The 
obstacle-free path 7 : [0, s] —» Yf ree is split into N — 1 equal 
length segments %. The ith robot is then assigned to a 
segment by the function k(i) based on Euclidian distance 
to its midpoint. Segments are enlarged to define the region 
Xj g for i 7 ^ 0 , L Since this is a heuristic for the goal 
configuration, the only requirement on X gg is that 
7 k(i) : [0,s] —■» X i>g . A typical choice is 




: min 

s 




where d g is a parameter controlling the enlarged size of 
X ijg . The predicted final configuration is then computed as 
the Cartesian product X g = ]^[|L 1 X i g- The construction of 
Xg described above is based on the heuristic that a feasible 
goal configuration in an environment with obstacles will 
resemble a line-of-sight communication chain. Increasing 
the size of X g with large values of d g limits the implication 
of this assumption. 


D. Experimental Results 

The randomized motion planner is able to find 
feasible configurations that allow target servicing at 



Fig. 11. The global planning task for five robots in a complex 
environment. We depict the series of waypoints that x 5 must visit and 
the initial conditions for , x 2 , x 3 , x 4 . The inset images depict our 
experimental system deployed in the environment. 
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(d) 



Time i*l 


Fig. 12. The end-to-end rates of the nodes during the global planning 
experiment depicted in Fig. 11. In each plot, the shaded envelope 
depicts a, and the variations that occur with probability e = 0.75 based 
on a, while the dashed black line depicts the instantaneous rate a,. 
Note that a 5 min = 0.25 while all other a ; min = O. 


positions not attainable with the local control approach 
from Section V-A. It can additionally provide a feasible 
sequence of configurations to get to the target configura¬ 
tion. To test the global planner on a system with one fixed 
access point and five robots, we introduce a sequence of 
task potential functions Tq(x), d/ 2 (x),..., d/ M (x) that 


require the lead node X5 to visit a sequence of positions 
while the remaining four robots act as relays to support 
end-to-end communication with the access point of 
^5, min = 0.25. We require that this end-to-end rate be 
satisfied with probability e = 0.75. The problem is made 
more complicated by the introduction of obstacles that not 
only block robot motions but also degrade received signal 
strength in (7) by 7.6 dBm when line-of-sight is lost. 

The global planning algorithm described in Section V-C 
is queried to find a feasible path cr* : [0, s] —> R 10 for each 
task \fq(x) in order such that <Tj(0) = <7j_i(s) and a i(0) = 
Xi n it. The trajectory of the lead node X5 for the concate¬ 
nation of paths a = <Ji \ .. . |<Jm is depicted in Fig. 11. After 
solving for a feasible path, x des (t) is computed so that x(t ) 
follows a. 

We conduct an experimental deployment of five robots 
controlling the feasible trajectory a for the problem de¬ 
picted in Fig. 11. In this experiment, each robot is 
equipped with a 2.4-GHz Zigbee radio that is used to 
broadcast and receive packets in order to measure in¬ 
stantaneous received signal strength that can be used to 
estimate the supported communication rate Rij(t) between 
node i and j at time t. Using these measurements, in con¬ 
junction with the network routing solution cx(t), we can 
estimate the actual supported end-to-end rate at time t for 
each node i, di(a.(t), x(t)). The metric for the performance 
of our approach is then that each di(a.(t),x(t)) > a ijin i n 
with probability e across the duration of the trial. 

Fig. 12 depicts the predicted d;(a,(t), x(t)) and mea¬ 
sured di(c\(t), x(t)) end-to-end rate of each node for the 
duration of the experiment. First, note that over the entire 
trajectory produced with our global planning methods, the 
prediction of end-to-end rates always satisfies the problem 
specification that P[a;(oyx) > a i?m i n ] > e. Second, when 
we examine the actual end-to-end rates based on real 
measurements of point-to-point wireless channels we see 
that it is generally true that di(a,(t), x(t)) > a^ m i n . In fact, 
if we analyze this constraint across the duration of the 
experiment, we find that the fraction of time spent below 



Fig. 13. Snapshots of the network configuration during the experimental deployment based on global planning to satisfy the task in Fig. 11. 
The end-to-end rate performance is depicted in Fig. 12. (a) t = IOO s ; (b) t = 278 s; (c) t = 482 s; (d) t = 622 s. 
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Fig. 14. Running time for a benchmark planning task computed with 
an increasing number of robots N and different end-to-end rate 
thresholds a mjn . Since the global planning algorithm is randomized , 
each point is the average running time over ten trials . 


the minimum threshold for each of the instantaneous rates 
di, eh, (23, £4, and ds is 9.2%, 0.8%, 0.3%, 0.6%, and 2.9%, 
respectively. 

Representative configurations from the experiment are 
depicted in Fig. 13. In Fig. 13, at t = 100 s, the predicted 
goal state X g assumes the shortest line-of-sight path which 
is the left hallway, i.e., a similar result to the reactive 
methods in our local control algorithm. As the system 
transitions to Fig. 13(b), where the lead node X5 has been 
tasked to a waypoint in the right hallway, the prediction for 
X g shifts to a chain of relays going through the right 
hallway. This shift in the basic topology of X g focuses on 
exploration of the joint state space so that X4 moves 
towards a configuration that will lower the performance of 
the network over the short term. As node X5 completes the 
desired loop, it utilizes X4 as a relay channel and is able to 
maintain the desired end-to-end rate. It is this dramatic 
shift in network topology that highlights the advantage of 
our global planning approach as we are able to accomplish 
continuous end-to-end rate maintenance that would not be 
possible with a purely reactive method. 

With regards to the running time of the randomized 
global planning algorithm, we note that it is difficult to 
characterize the performance of randomized search algo¬ 
rithms. One factor is the complexity of Xf ree which is de- 
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